








In This Issue 








Sometimes it pays to look at old problems in new ways and, 
instead of enjoying the comfort of the apparent correspon- 
dence of accepted theory and accumulated empirical evi- 
dence, to puzzle about the anomalies that accepted theory 
fails to explain. ““To be accepted,’ said Thomas Kuhn in 

The Structure of Scientific Revolutions, “ta theory must 
seem better than its competitors, but it need not, and in fact 
never does, explain all of the facts with which it can be 
confronted.” 


Each of the four articles in this issue addresses problems that 
have been discussed before in these pages and elsewhere. 
Each author exposes an anomaly in an accepted paradigm. 
This is good research technique; one example of failure is 
sufficient to falsify a theory. Each author then enthusias- 
tically proposes and defends an alternative procedure. The 
alternative appears to work in the instances cited. You and 

I might do well to recall, as we read of these new ideas, that 
each one need not “explain all the facts with which it can be 


confronted.” 


Baron returns to the issue of whether owner-operated and 
cash-rent farms are more efficient than share-rent farms. He 
finds that the old answers to this question may be wrong 
when risk is taken into account. He goes further and demon- 
strates that the issue cannot be resolved by pure theory. 

It is, instead, an empirical question. The answer depends on 
such variables as relative costs, relative returns, degree of 
risk, and intensity of risk aversion. 


Belongia and Dickey demonstrate that the usual procedures 
used to filter out serial-correlated disturbances from time 
series in order to test for causality will occasionally fail. 
The authors propose an alternative filter that uses an auto- 
regressive-moving-average model, and they illustrate with 

a successful application. 


Lutton addresses the problem of statistical estimates of 
production functions and of derived input demand curves. 
He explains the limitations of traditional, direct procedures 
for estimating these equations and explains how develop- 
ments in the theory of duality facilitate obtaining them. 
Because duality estimates rely on a different data set than do 


direct estimates, duality theory is sometimes the only fea- 
sible procedure for circumventing data limitations. 


LeBlanc is concerned with estimates of shares. In such prob- 
lems, the dependent variable is constrained to lie within the 
unit interval. Ordinary regression may provide estimates 
outside the required interval. Ordinary binary choice models 
only recognize the endpoints of the interval (0 and 1). 
LeBlanc illustrates the application of a multivariate logit 
framework to estimate and explain input cost shares for 
agriculture. 


Clark Edwards 
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Abstract 


Most analyses of allocative efficiency under different forms of agricultural tenure—share tenancy, 
fixed cash tenancy, and owner cultivation—employ single-product models of production. These 
models show that risk sharing encourages share tenants to produce as much as or more than 
equally risk-averse owner-operators and cash tenants. However, when risk and risk aversion are 
introduced into multiproduct linear programming models, relative allocative efficiency under 
share tenancy may decline. The result depends on the relative production costs and the relative 


risk premiums of the different products. 
Keywords 
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The 1978 Census of Agriculture reports that nearly 40 per- 
cent of all land in farms in the United States is rented and 
that approximately 86 percent of this rented acreage is owned 
by nonoperator landlords (45).' Studies by Reinsel and 
Johnson (28), Johnson (21), and Reiss (29, 30, 31) suggest 
that the contributions landlords make to the management 
of the land they rent out has declined in recent years. An 
increasing percentage of tenants simultaneously rents several 
tracts of land from several landlords rather than a single 
tract from a single landlord. Contributions made by individ- 
ual landlords to total rented acreage operated by tenants 
have declined, as have opportunities and incentives for land- 
lords to contribute entrepreneurial skills and to help finance 
production costs (28, 29, 30, 31). Tenants have expressed 
greater willingness to operate without assistance from land- 
lords. Many have indicated they prefer arrangements that 
provide them primary managerial control over farming 
operations so that they can coordinate their owned and 
rented resources effectively (6, 7,22, 28, 29). 


The increasing frequency of rental arrangements that allocate 
most managerial responsibilities to tenants has renewed in- 
terest in an issue that Alfred Marshall first raised: Is the ef- 
ficiency of resource allocation lower on tenant-operated than 
on owner-operated farmland? Many economists have argued 
that tenants who operate with little managerial or cost-sharing 
assistance from nonoperator landlords, but who pay landlords 
fixed cash rents, will farm just as efficiently as will owner- 
operators. However, if these tenants pay share rents, they 





*The author is an agricultural economist with the Natural 
Resource Economics Division, ERS. 

‘ Italicized numbers in parentheses refer to items in the 
References at the end of this article. 


will farm less efficiently than will either owners or cash tenants 
(1, 4, 11, 13, 14, 15, 16, 20, 23, 24, 26, 32, 33). 


Most analyses of farm tenancy have employed single-input, 
single-product models of production. Owner-operators and 
fixed cash tenants receive the entire value of the marginal 
product from each unit of input, and they employ inputs up 
to an amount at which marginal value product equals marginal 
input cost. Share tenants pay a share of the marginal value 
product as rent. The level of input employment at which 
their own share of marginal value product equals marginal 
input cost is lower than the equilibrium level under owner 
operation and cash tenancy. The single-input, single-product 
model, therefore, predicts that share tenants will produce 
less under equilibrium than do owner-operators and cash 
tenants.” 


Other studies of farm tenancy have applied multi-input, 
multiproduct, linear programming models. Within a range of 
output-price ratios, share tenants maximize their own net 
revenues, according to these models, by producing greater 
amounts of lower cost products and smaller amounts of higher 
cost products than owner-operators and cash tenants produce. 
Because output combinations chosen by owners and cash ten- 
ants always maximize total net revenues for the entire farm, 
combinations chosen by share tenants yield less than the 
maximum net farm revenues. The discrepancy measures the 
loss of efficiency attributable to the share rental arrange- 
ment within the linear programming model (10, 14, 17, 23, 25). 





? Many economists recommend that landlords share 
variable costs with their share tenants in the same proportion 
that they share output, As the marginal value received by the 
tenants will then equal the share of input cost they must pay, 
they will be encouraged to employ as much of the input as 
owners and cash tenants employ (1, 2, 3, 15, 16, 18, 23, 27). 
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Incorporating Risk and Risk Aversion into 
the Single- and Multiproduct Models 


Previous multiproduct programming models of farm tenancy 
have either ignored production risks or have assumed that all 
farm operators are risk neutral. They have not considered 

how risk and risk aversion might affect predictions of tenancy- 
related differences in farm planning. 


A few presentations of single-input, single-product models 
have addressed this question. Sutinen (34, 35) and Baron (5) 
showed that, although production by risk-averse operators de- 
clines as risk levels increase, this reduction is smaller under 
share tenancy than under owner operation and cash tenancy. 
Risk-sharing increases share tenants’ production relative to 
owner-operator and cash tenants’ production. The excess 

of owner-operator/cash tenants’ production over share ten- 
ants’ production that occurs under risk neutrality is, there- 
fore, always reduced, or possibly eliminated, under risk and 
risk aversion (5; 19, p. 24). If the negative effect of marginal 
product sharing under share tenancy is overcome through 
landlord supervision or through contractual specification of 
tenants’ obligations to employ veviable inputs, risk sharing 
may encourage share tenants to produce even more than 
owners and cash tenants produce (35, pp. 617-19). 


However, risk sharing may not have the same unequivocally 
positive impact on the efficiency of share tenancy relative to 
owner operation and to cash tenancy when it is examined 
within a multi-input, multiproduct programming model. 

To introduce risk and risk aversion into such a model, sev- 
eral assumptions need to be made about the attitudes of farm 
operators toward risk and toward changes in risk levels 
associated with changes in production levels. First, assume 
operators measure risk as a dispersion of gross revenue per 
acre of each crop around expected gross crop revenue per 
acre. Moreover, operators assign a cost, or risk premium, 

to this risk level. The premium equals some constant percent- 
age of expected gross crop revenue per acre. Thus, the mar- 
ginal rate of increase in the risk premium that results from 
an increase in production will be constant at all levels of 
production. Risk premiums for each crop will be included 

in the objective function as constant costs per acre of output, 
so that the function will remain linear. 


Price maps for alternative output combinations can display 
the results of parametric programming of the objective func- 
tion. Two sets of maps are presented in this study. The first 

is derived for a risk-neutral owner-operator and cash tenant 
and for a risk-neutral share tenant through maximization of 
the standard risk-neutral objective function presented in pre- 
vious studies. This set is compared with a second set of price 
maps derived for risk-averse owner-operators and cash tenants, 
and for risk-averse share tenants through maximization of the 
risk-adjusted or certainty-equivalent objective function defined 
here. The comparison shows that the introduction of risk and 


risk aversion into the programming model may either increase 
or decrease the likelihood that an inefficient share tenant plan 
will be chosen, depending on the relative prices, relative input 
costs, and relative risk premiums per acre of soybeans and 
corn. That is, contrary to the conclusion of other studies 
reviewed above, risk and risk aversion may either augment or 
offset the negative effect of product-sharing on the efficiency 
of share tenancy relative to owner operation and cash tenancy. 


Price Mapping Under Risk Neutrality 


The programming model considers alternative production 
plans on a 200-acre farm in Illinois that produces soybeans or 
corn. Cost estimates for this farm are available from the Firm 
Enterprise Data System (FEDS) developed by the Economic 
Research Service and Oklahoma State University (12). 


The most recent FEDS estimates, based on data from the 
1977 crop year, suggest that variable costs per acre of corn are 
approximately twice as high as those for soybeans. These esti- 
mates are derived from a production function which utilized 
approximately 30 inputs. For convenience, the programming 
model used here assumes that land is the only resource in 
limited supply. 


A simplified linear programming model of farm planning by 
a risk-neutral operator can be represented by the equation 
below. 


Maximize: 

I= (rp; ~ ¢) ay + (rpg - he) ag 
subject to: 

a, + ag = 200 acres 


where | is net farm income; p; and pg are gross revenues per 
acre of soybeans and corn, respectively; r is the share rental 
rate, which equals 1 for owner-operators and cash tenants 
and some fraction less than 1 for share tenants; a, and ag are 
total soybean and corn acreages; c and he are costs per acre 
of soybeans and corn; and h is the ratio of per acre costs of 
corn to soybeans, The constraint equation indicates that 

200 acres of land are available. 


Parametric variation of the coefficients, rp; ~ ¢ and rpg ~ he, 
defines price maps which indicate the combinations of soy- 
bean and corn acreages that yield maximum profits for farm 
operators at any given ratio of gross soybean revenue to gross 
corn revenue. As land is the sole resource constraint, only 

two plans may be optimal at any given ratio: “soybeans only” 
or “corn only.” 


The price map for the risk-neutral owner-operator and cash 
tenant is represented by AB in figure 1. In this situation, 








Figure 1 





Price Map for Risk-Neutral Farmers 
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r= 1. The axes measure gross revenues per acre of corn and 
soybeans. For all per-acre revenue combinations which fall 
to the left of AB, owner-operators and cash tenants receive 
higher net revenues per acre from soybeans than from corn. 
Thus, they produce soybeans only. For all combinations to 
the right of AB, they receive more from corn than from soy- 
beans, and they produce corn only. 


For risk-neutral share tenants, assume that r equals 0.6 and 
that the tenants receive no cost sharing from landlords, so 
that their variable costs remain at c per acre of soybeans and 
at he per acre of corn.’ The share tenants’ price map is 


defined by line CD, which is the new boundary between the soy- 


beans-only and corn-only farm plans. Revenue combinations 
within the area bounded by AB and CD define a different 





°> This assumption may be challenged on grounds that 
proportionate cost sharing by U.S. landlords is quite com- 
mon. However, Berry (6, 8, 9) and Harwell and Strickland 
(14) report that, in the Great Plains Region, many share- 
rental landlords and tenants avoid cost sharing because they 
consider it a partnership arrangement which demands pre- 
cisely the type of landlord participation in farm manage- 
ment they want to avoid. These landlords and tenants 
typically choose share leases which assign most management 
responsibilities to tenants, including responsibility for financ- 
ing variable costs. Reiss (29, 30, 31) reports that even in 
Illinois, where cost sharing has been widely practiced, a trend 
has developed toward greater use of net share leases-—that is, 
share leases which assign all variable costs to tenants. Again, 
the motivation may be enter preference for leases which 
assign landlords little role in farm management or in the 
financing of production costs. 


optimal plan for share tenants than for owner-operators and 
cash tenants. Combinations falling to the left of AB and to 
the right of CD result in the adoption of the same plan by 
all operators. 


Maps AB and CD define ratios of gross corn to gross soybean 
revenue at which net soybean and corn revenues are equal. 
CD is further right than AB, indicating that share tenants 
require a higher ratio of corn revenue to soybean revenue 
before they will switch from soybeans to corn. To determine 
why CD is further right, observe that the equality between 
net soybean and corn revenues per acre is: 


rp; ~- C=rpg- he 
which implies that: 
Py =Pg- e(h- 1)/r (1) 


Under owner operation and cash tenancy, r = 1; equation (1) 
indicates that gross revenue per acre of corn (po) must exceed 
gross revenue per acre of soybeans (p,) at all points on AB by 
exactly the amount that cost per acre of corn exceeds cost 
per acre of soybeans (c(h - 1)). Under share tenancy, r = 0.6; 
equation (1) indicates that gross corn revenues per acre must 
exceed gross soybean revenues per acre at all points on CD 

by approximately 1/0.6 (or 1.67) times the cost differential. 


The price maps also indicate why allocative efficiency is 
lower under share tenancy than under owner operation and 
cash tenancy. Net soybean and corn revenues received by 
owner-operators and cash tenants, p; - c and pg ~ he, are 
identical to corresponding net crop revenues generated for 
the whole farm. Therefore, the farm plan that owner- 
operators and cash tenants select always maximizes total 

net farm revenues. Net revenues received by share tenants, 
0.6p,; - c and 0.6pg9 ~ he, are less than the net crop revenues 
received by the entire farm. Where the share tenant’s farm 
plan differs from the owner-operator and cash tenant’s farm 
plan, as is the case within region ABCD, total net farm 
revenues generated by the share tenant’s plan will necessarily 
be less than the maximum possible. As an example, con- 
sider point F. Here, p, = $230, po = $310, c = $60, and h = 2. 
The share tenant’s plan—200 acres of soybeans—yields 

total net farm revenues of $34,000. A maximum possible 
net revenue of $38,000 is generated by the owner-operator 
and cash tenant’s plan—200 acres of corn. 


Price Mapping Under Risk and Risk Aversion 


The introduction of risk and risk aversion into the linear 
programming model requires adjustments in the objective 
function. Gross revenues per acre are uncertain, expected 
values. The operator’s reaction to risk is measured as 
reductions in expected gross soybean and corn revenues per 
acre. Assume that these reductions, or risk premiums, equal 





constant percentages of expected gross revenues. Thus, the 
risk premium from 1 acre of soybeans equals the given per- 
centage rate multiplied by the expected gross soybean reve- 
nue per acre. 


With the risk premiums, the objective function becomes: 
I= rp a, ~ ca; ~ r6,p,a; ™ rpo9ao ~ hea; = rO5P9a9 (2) 


Risk premiums per acre of soybeans and corn are 4,p, and 
9oPo, respectively. 8; and 45 are the risk premium coeffi- 
cients; they indicate the percentages of expected gross soy- 
bean and corn revenues deducted to account for risk. 


The assumption that risk premiums per acre remain constant 

as gross crop revenues per acre and total acreages increase may 
appear restrictive. Risk premiums per acre may in fact be 
highly correlated with gross crop revenues. Moreover, if covar- 
iance between gross soybean and gross corn revenue is nonzero, 
risk premiums per acre may vary as output ratios vary. 


However, if risk premiums per acre are expressed as func- 
tions of gross revenues and as functions of output ratios, 
the objective function will be nonlinear. This nonlinearity 
will make the analysis more complicated, but little will be 
gained conceptually. The linear objective function of equa- 
tion (2) is sufficiently complex to establish the fundamental 
point of the analysis—namely, that share tenancy may com- 
pare even less favorably with owner operation and cash ten- 
ancy under risk and risk aversion than under risk neutrality. 


Parametric variation of certainty-equivalent net soybean and 
corn revenues per acre, rp; - c- r0,p, and rpg - he - rOopo, 
defines price maps for risk-averse farm operators. Two sets of 
these maps are derived. In the first, 0; is assumed to be 
greater than 5, indicating that operators consider soybeans 
riskier than corn. In the second, @o is greater than 0, 
indicating that corn is riskier than soybeans. 


Case 1: 6, > 0, 


Whereas maps AB and CD (fig. 1) equate expected net soy- 
bean and corn revenues per acre, price maps for risk-averse 
farm operators (fig. 2) equate certainty-equivalent net soy- 
bean and corn revenues per acre. When 4; > 09, equivalency 
is satisfied on map A’B’ for owner-operators and cash tenants 
and on map C’D’ for share tenants. For purposes of com- 
parison, maps AB and CD are also shown in figure 2. To the 
right of A’B’, risk-averse owners and cash tenants receive 
higher certainty-equivalent net revenues from corn than from 
soybeans; therefore, they produce only corn. To the left 

of A’B’, they receive more from soybeans than from corn 
and produce only soybeans. Similarly, to the right (left) 

of CD’, share tenants receive higher certainty-equivalent 
revenues from corn (soybeans) than from soybeans (corn); 
therefore, they produce only corn (soybeans) at all points. 


Figure 2 





Price Map for Risk-Averse Farmers When Soybeans 
are Considered Riskier than Corn 


Gross revenue per acre 
of soybeans, p, 


All operators 
produce soybeans 


(1-65) c(h — 1) 
(1-6) 


Pp’; = Po 
(1-6) 


= ae 
= p,(1-82)  ah-!) 
(1-@;) (1-64) 


P’y 


All operators 
produce corn 


| | 
220 260 310 320 360 


Gross revenue per acre of corn, po 











Equality of certainty-equivalent net revenues per acre, 
rp; ~- c~ r0;p, = rpg - he - rAgpo, implies that the value 
of p; on A'B andC D’, hereinafter Pi, must be: 


p} = po(1- 8,)/(1- 0,)- e(h- 1)/r(1- 8,) (3) 
This compares with a value of: 
Py = Pg~ c(h- 1)/r (4) 


on maps AB and CD. Note that when 0; > 49, the slope of 
A'B' andC'D’ (1- 99/1 - 6;) is slightly greater than the slope 
of AB and CD, which equals 1. However, the vertical inter- 
cept of AB’ and C’D’ (-c(h - 1)/r(1 - 0,)) is smaller than 
that of AB and CD (-c(h- 1)/r). It follows that Py in equa- 
tion (3) is greater than, equal to, or less than p, in equation 
(4) and that A’B’ and C’D’ therefore lie above, intersect 

with, or fall below AB and CD, depending on whether: 


Po(1~ O,)/(1- 64)~ po Seth - 1)/r(1 - 43) 


“en = Tir 


or, to simplify, on whether: 


Po = (0/0; - Ao)(e(h - 1)/r) 








The more 0, exceeds 45 and the smaller h is, the greater is 
the value of pg relative to (0; /0, - 89)(c(h- 1)/r) and the 
greater are the values of p; on A’B’ and C’D’ relative to p, 
on AB and CD, respectively. 


Suppose the terms in equation (6) are limited to a range of 
values within which po > (0; /8; - 89)(e(h - 1)/r) and, thus, 
P, > P,. For example, assume that input costs per acre, 

c and he, approximate the recent FED estimates: $60 for 
soybeans and $120 for corn. Also assume that 0, exceeds 

5 significantly. Reasonable values might be: 6; = 0.04 per 
$1 of expected gross soybean revenue, and 45 = 0.01 per 

$1 of expected gross corn revenue. It follows from equation 
(6) that p; > p, for all py > (4/3)(60/4). Under owner 
operation and cash tenancy, Pj on A’B’ will exceed p; on 
AB, and thus A’B’ will lie above AB (fig. 2) for all cases where 
P> > $80. Under share tenancy, with r = 0.6, p; on C’D’ will 
exceed p; on CD, and thus C'D’ will lie above CD, for all 
cases where po > $133.33. Furthermore, given that during 
the 1974-80 period, the minimum reported gross corn reve- 
nue in Illinois was about $220 per acre, Pi > P} is the only 
relevant case. Therefore, figure 2 shows only those portions 
of maps A’B’ and C’D’ that lie above AB and CD. 


For an intuitive explanation of why A’B’ and C’D’ lie above 
AB and CD, consider the equality of expected net crop reve- 
nues that occurs on AB and CD. Recall that this equality 
implies that p) = pg - c(h~- 1)/r. The risk premium per 

acre of soybeans on AB and CD is 6) p; = 9;(pg- c(h ~ 1)/r). 
Suppose c = $60, he = $120, 0, = 0.04, and 05 = 0.01. Then 
9, p, is greater than opp (the risk premium per acre of corn) 
at all points on AB where po > $80 and at all points on CD 
where po > $133.33. At all such points on AB and CD, cer- 
tainty-equivalent net revenues are lower for soybeans than 
for corn. For certainty-equivalent net crop revenues to be 
equal, as is the case on maps A’B’ and C’'D’, expected net 
soybean revenue per acre must exceed expected net com 
revenue per acre to offset the higher risk premium per acre 
of soybeans. Thus, for each combination of expected gross 
soybean and expected gross corn revenues per acre defined 
by a point on AB or CD, the point on A’B’ or C’D’ that defines 
the same gross corn revenue per acre must have a higher gross 
soybean revenue per acre. 


We can also show that p; on A’B’ exceeds p; on AB by more 
than the amount that p} on C’D’ exceeds p; on CD. That is, 

for any Pg, the shift from AB to A'B’ exceeds the shift 

from CD to C’D’. If Pp} is expressed as in equation (4), then: 


P; - Py = Po(9, - O,)/(1- 6,) - 0, c(h - 1)/x(1 - 95) (7) 


which is greater under owner operation and cash tenancy, 
with r = 1, than under share tenancy, with O< r< 1. 


Suppose gross corn revenue is $320 per acre, 0; = 0.04, 
95 = 0.01, r = 0.6, and input costs per acre are again $60 for 


soybeans and $120 for corn. Gross soybean revenue then 
increases from p; = $260 per acre on map AB to P} = $267.50 
per acre on map A’B’, but only from $220 per acre on map 
CD to $225.83 per acre on map C’D’. The risk-induced shift 
from AB to A’B’ is ($267.50 - $260) - ($225.83 - $220) = 
$1.67 greater than the shift from CD to C’D’. 


Thus, when po, c, and hc have reasonable values and when 0, ex- 
ceeds 44 by enough to ensure that equal expected net corn 
and soybean revenues per acre imply higher risk premiums 
per acre for soybeans than for corn, risk and risk aversion 
will affect farm planning under owner operation and cash 
tenancy more than under share tenancy. Share tenancy’s 
standing relative to owner operation and cash tenancy 
should also improve. Recall that, under risk neutrality, the 
crop plan chosen by owners and cash tenants is always 
optimal in the sense that it maximizes total expected net 
crop revenues for the entire farm. At all points to the left of 
AB, this optimal plan is 200 acres of soybeans; at all points 
to the right, it is 200 acres of corn. 


To the left of A’B’ and to the right of C’D’ in figure 2, all 
risk-averse farmers do in fact choose optimal farm plans. 
Within region A'B’ AB, however, only risk-averse share tenants 
choose the optimal plan—200 acres of soybeans; within 
ABC'D’, only risk-averse owners and cash tenants choose 

the optimal plan—-200 acres of corn. 


Within A’B’ AB, the risk-averse share tenant’s plan is more 
profitable to the farm than the risk-averse owner and cash 
tenant’s plan.* This situation contrasts with the case of 
risk neutrality (fig. 1). Recall that the risk-neutral share 
tenant’s plan can be equally profitable at best, but can never 
be more profitable than the owner and cash tenant’s plan. 
Moreover, recall that within region ABCD, the risk-neutral 
share tenant’s plan is less profitable to the farm than the 
risk-neutral owner and cash tenant’s plan. Thus, ABCD is 
analagous to ABC’D’, which encompasses an area where 
the risk-averse share tenant’s plan is less profitable to the 
farm than the risk-averse owner and cash tenant’s plan. 
Observe that ABC’D’ is smaller than ABCD. That is, the 
number of soybean-corn revenue combinations that result 
in a suboptimal share tenant’s plan is smaller under risk 





* The objective function for risk-averse share tenants who 
receive proportionate cost share payments from landlords is 
0.6 (pya, — Cay — 01 P, a, + Pgag — hcag - 6gPgagq). Maximi- 
zation of this function is equivalent to maximization of the 
risk-averse owner-operator and cash tenant’s objective func- 
tion at all crop revenue combinations. Within ABA’B’, the 
crop plan chosen by risk-averse share tenants who receive 
proportionate cost share payments as well as by risk-averse 
owners and cash tenants is less profitable to the farm than 
the plan chosen by risk-averse share tenants who receive 
no-cost share payments. This result contradicts the claim 
of many economists (discussed in footnote 2) that propor- 
tionate cost sharing should always increase allocative effi- 
ciency under share tenancy. 


a 





and risk aversion than under risk neutrality. The prob- 
ability that the revenue combination in any given year will 
result in a suboptimal share tenant’s plan is also smaller 
under risk and risk aversion than under risk neutrality. 


Therefore, when risk and risk aversion are incorporated into 
the programming model under the assumption that soybeans 
are riskier than corn (9; > 09), share tenancy should compare 
more favorably with owner operation and with cash tenancy 
than it does in the absence of risk and risk aversion. The 
probability that the share tenant’s plan will be less profitable 
to the farm than the owner-operator and cash tenant’s plan 
should decline. The probability that the share tenant’s plan 
will be more profitable should increase from zero to some 
positive value. 


Case 2: 6, > 0; 


The results are much different if operators consider corn 
riskier than soybeans. For 05 > 0;, parametric variation of 
certainty-equivalent net soybean and corn revenues per 
acre, rp; ~ c~ r9,p, and rpg - he - roo , defines price 
maps A’B” and C’'D” (fig. 3). On these maps, as on maps 
A’B’ and C’D’, certainty-equivalent net crop revenues are 
equal; therefore, the value of p; on A’’B” and C’'D"’, here- 
inafter Pi is: 


P= Po(l~ O9)/(1- 6,)-eth-1)mA- 9,) (8) 


Figure 3 





Price Map for Risk-Averse Farmers When 
Corn is Considered Riskier than Soybeans 
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Note that with 05 > 0}, the slope of AB” and CD", 

(1 - 49)/(1 - 9}), is slightly less than 1, the slope of AB and 
CD. Furthermore, the vertical intercept of A"B” and CD” 
(-c(h - 1)/r(1 - 9;)) is less than the intercept of AB and CD 
(-c(h - 1)/r). Thus, p} will fall below p;, and AB” and 
C’’D” will fall below AB and CD, for all po. Note also that 
the more 45 exceeds 0, and the larger h is, the more p; 
exceeds Pi and the greater the gap is between the price maps, 
for any Po. Furthermore, p; on CD always exceeds P1 on 
CD" by more than p, on AB exceeds p on A’'B”. That is, 
the shift from CD to C’’D”’ everywhere exceeds the shift from 
AB to AB". If p; is expressed as in equation (4), then: 


ee Pi = P9(92 = 6/0 = 95) 
+ 6,¢(h - 1)/r(1 - 6}) (9) 


which is higher under share tenancy than under owner opera- 
tion and cash tenancy.° 


For example, suppose gross corn revenue again equals $320 
per acre, while 0; = 0.01, 05 = 0.04, and input costs per acre 
again equal $60 for soybeans and $120 for corn. Gross soy- 
bean revenue must then decline from $220 on map CD to 
$209.29 on map C’’D”’, but only from $260 on map AB to 
$249.70 on map A’’B’’. The risk-induced shift from CD to 


Malt 


C’’D” is $0.41 greater than the shift from AB to A’’B”’. 


Thus, given any values of pj, po, c, h, 1, and 02, if Oo >), 
risk and risk aversion will always affect farm planning more 
under share tenancy than under owner operation and cash 
tenancy. Moreover, the impact of risk and risk aversion 

on the comparative efficiency analysis, if 99 > 01, is quite 
different than the impact if 0, > 5. When 05 > 01, we 
compare A’B’"C’’D” and ABCD (fig. 3). A’'B’'C’'D”’ defines 
combinations of expected gross soybean and corn revenues 
per acre where risk-averse owner-operators and cash tenants 
(but not risk-averse share tenants) choose the optimal, profit- 
maximizing plan—200 acres of corn. Region ABCD defines 
gross revenue combinations where risk-neutral owners and 
cash tenants (but not risk-neutral share tenants) choose 

this same optimal plan. Because the shift from CD to C’’D” 
exceeds the shift from AB to A’'B”’, the number of soy- 
bean-corn revenue combinations falling within A’B’'C’’D” 
exceeds the number falling within ABCD. That is, the share 
tenant’s farm plan is less profitable to the farm than the 
owner and cash tenant’s plan at more revenue combinations 
under risk and risk aversion than under risk neutrality. Every- 
thing else being equal, the probability that the revenue com- 
bination that occurs in any given year will fall within 
A’B''C''D" will exceed the probability that it will fall within 





‘ Note that for any 09 > 61, the excess of @gp2 over 0, P1 
will decline, and the gap between A’’B"’ and AB and between 
C''D” and CD will narrow as pj and pp» decline. 








ABCD. Thus, the probability that the share tenant will 
actually choose a suboptimal plan will also be higher under 
risk and risk aversion than under risk neutrality.° 


Comparison of Crop Risks 


This analysis has shown that the impact of risk and risk 
aversion on the comparative efficiency analysis presented 

by the programming model depends on the values of the risk 
premium coefficients assigned to the alternative crops. This 
result raises the question of which of the two sets of values 
assumed in this article better represents the attitudes of 
farm operators toward the riskiness of soybeans and corn. 


One answer is suggested by estimates of the variability of 
gross soybean and corn revenue that can be derived from 
recent average annual yield and price data provided by 
USDA’s Crop Reporting Board (37, 38, 39, 40, 41, 42, 43, 
44). Let us take Illinois as an example. A sample of seven 
gross corn and soybean revenues per acre of Illinois cropland 
from the 1974-80 period indicates recent variations. The 
sample standard deviation was approximately $39 per acre 
for corn and $33 per acre for soybeans. Corn, therefore, 
exhibited greater variability than soybeans. Thus, a typical 
Illinois farm operator would likely consider corn as the 
riskier crop. The assumption that 09 > 0, appears to be more 
appropriate than the assumption that 0; > 09. 


If so, the conclusion for case 2 applies. Share tenancy will 
likely compare less favorably with owner operation and cash 


tenancy under risk and risk aversion than under risk neutrality. 


Conclusion 


Previous analyses of the comparative efficiency of different 
farm tenancies under risk and risk aversion have relied ex- 
clusively on single-input, single-output models. When risk and 
risk aversion are introduced into these models, the efficiency 
of share tenancy relative to owner operation and cash tenancy 
always increases. 


When the comparative efficiency analysis is extended to multi- 


product programming models, however, the results can be 
quite different. Under risk neutrality, share tenancy compares 
unfavorably with owner operation and cash tenancy. Within a 
certain range of crop revenue combinations, share tenants 





* It is conceivable, if improbable, that despite the greater 
number of revenue combinations within A’’B’’C’'D” than 
within ABCD, the probability that the combination in any 
given year will fall within ABCD still exceeds the probability 
that it will fall within A’’'B’'C’'D"’. The probability that the 
share tenant will choose an inefficient plan will then be lower 
under risk and risk aversion than under risk neutrality. The 
introduction of risk and risk aversion into the model will 
offset, rather than augment, the distortive effect of product- 
sharing on the relative efficiency of share tenancy. 


choose a crop plan which is less profitable to the farm as a 
whole than the plan owner-operators and cash tenants choose. 


To determine the impact of risk and risk aversion on this 
comparative efficiency analysis, we examined the implica- 
tions of two different assumptions concerning reactions of 
farm operators to income variability. Case 1 assumed that 
farm operators assign a higher risk premium to $1 of gross 
soybean revenue than they assign to $1 of gross corn revenue, 
even though the variable production cost per acre of corn 

is higher than the cost of soybeans. The number of soybean- 
corn revenue combinations that call for a suboptimal share 
tenant’s plan is then lower under risk and risk aversion than 
under risk neutrality. Some revenue combinations even re- 
sult in a risk-averse share tenant’s plan that is more profitable 
than the risk-averse owner-operator and cash tenant’s plan. 
The probability that the share tenant’s plan will be less 
profitable than the owner and cash tenant’s plan is unequiv- 
ocally lower under risk and risk aversion than under risk 
neutrality. 


In case 2, risk and risk aversion were introduced under the 
assumption that the risk premium per $1 of gross corn 
revenue is higher than the risk premium per $1 of gross soy- 
bean revenue. The number of soybean-corn revenue combina- 
tions that result in a suboptimal share tenant’s plan is now 
greater under risk and risk aversion than under risk neutrality. 
Therefore, the probability that the share tenant’s plan in any 
year will be less profitable than the owner and cash tenant’s 
plan is also greater under risk and risk aversion than under risk 
neutrality. 


An empirical review of crop and yield data for Dlinois sug- 
gests that the second of the two assumptions—that is, that 
the risk premium per $1 of corn is greater than the risk 
premium per $1 of soybeans—more accurately represents 
Illinois agriculture. As the analysis in case 2 appears unfavor- 
able to share tenancy, a contrast between the single-input, 
single-output model and the multiproduct programming 
model becomes obvious. Although the presence of risk and 
risk aversion always improves share tenancy’s standing relative 
to owner operation and cash tenancy under the first model, 
it probably weakened share tenancy’s relative standing under 
the second model, at least for Illinois agriculture during the 
late seventies. 


Multiproduct models may generate different results for other 
regions and other time periods. Where the analysis in case 1 
applies, share tenancy will compare more favorably with 
owner operation and cash tenancy under risk and risk aversion 
than under risk neutrality. When risk and risk aversion are 
relevant, the comparative efficiency of alternative farm 
tenancies can no longer be determined by theoretical con- 
siderations alone. Comparative efficiency can be determined 
only after one has assigned observed or reasonable values to 
the model parameters. 
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Prefiltering and Causality Tests 
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Abstract 


If data series are not filtered properly prior to the construction of a test of causality, the resulting 
test statistics are invalid. This article describes a general approach to data filtering based on the 
estimation of autoregressive-moving-average models and on specific tests for the identification of 
white noise processes. For selected examples, traditional approaches to filtering do not perform 


as well as the general method proposed. 
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Statistical techniques developed for testing whether the 
behavior of one variable causes a subsequent change in the 
value of another variable have been widely used by econo- 
mists in recent years. The tests are often used to determine 
whether one variable can be treated as exogenous with respect 
to another. The tests are also used as a form of pretest esti- 
mation in determining whether a regressor makes a signifi- 
cant contribution to the explanation of the variation in a 
dependent variable. 


Although causality tests have become more widely used in 
recent years, many practitioners disagree over the validity 
of the results of these techniques.’ This article reviews some 
empirical issues relevant to the use of causality tests and the 
necessary documentation of the procedures employed. Be- 
cause the type of test employed in much applied research 
depends crucially on proper filtering of the data prior to 
testing, we will focus on problems that are likely to occur 
if economic data are transformed by the filter suggested by 
Sims, but the results are not checked against some white 
noise test. 





* Belongia is an economist with the National Economics 
Division, ERS, and Dickey is associate professor of statistics 
at North Carolina State University, Raleigh. 

’ The problem of detecting causal relations can be especially 
acute in bivariate models if the variables are likely to share a 
common relationship with a third variable; this problem is 
discussed at length in (8), (9), and several other papers. (Note: 
Italicized numbers in parentheses refer to items in the Refer- 
ences at the end of this article). Schwert (11) has argued that 
Box-Jenkins models may not be causal-preserving so that the 
use of autoregressive-moving-average (ARMA) model residuals 
in causality tests makes those tests subject to a potential 
errors-in-variables problem. That is, ‘‘if the original variables are 
measured with random errors, causality tests based on the 
estimated innovations series could fail to detect relationships 
that would be detected using the untransformed data.” A 
recent review of alternative testing procedures and their limi- 
tations is summarized in (/). 


We first outline the intuition supporting causality tests and 
discuss criticisms raised by Feige and Pearce regarding the 
testing procedures used in much of the applied literature (4). 
We then illustrate the potential problems associated with 
applying the Sims filter to economic data with examples 
using economic time series frequently employed in studies 
of inflation and price change. We discuss an alternative ap- 
proach to data filtering and diagnostic checks for white 
noise that uses an autoregressive-moving-average (ARMA) 
transformation. Finally, we make suggestions regarding the 
future use of causality tests. 


Causality Tests: An Overview 


Granger (6) defines econometric causality as follows: “Y;, is 
causing X; if we are better able to predict X; using all avail- 
able information than if information apart from Y; has been 
used.” Or more simply, Y; causes X; in the econometric sense 
if and only if one can predict X; better by using past values 
of Y; and X; rather than by basing the prediction on the past 
history of X; alone. Sources other than the original papers 
by Granger and Sims provide details on the mechanics of 
applying such a test (see (2)). The general idea is to estimate 
regressions with and without the additional information con- 
tained in Y. From these regressions one can construct a joint 
F-test on the significance of the coefficients associated with 
future values of Y. A significant F value would then suggest 
the presence of a causal relationship between X and Y. To 
check for feedback—or whether causation also runs from Y 
to X—one can repeat the test by regressing Y on past and 
future values of X. If neither joint F-test on the coefficients 
associated with future values of X or Y is significant, one can 
conclude that the two variables are unrelated. 


The importance of filtering the data prior to estimating the 
regression models and calculating the joint F-statistic is 
related to the spurious regression problem described by 
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Granger and Newbold. Data filtering is intended to remove 
the serial correlation present in most time series. If filtered 
properly, each resulting time series used in a test of causality 
will be a white noise process so that any significant relation- 
ships represented by the joint F-test will be based on actual, 
systematic relationships between the two series instead of on 
a spurious relationship caused by the common serial correla- 
tion. 


Adequate data filtering is essential to the validity of causality 
tests because the failure to remove serial correlation from the 
data will bias the estimates of coefficient variances. Because 
most time series share a similar pattern of serial correlation, 
the variances are likely to be biased downward. This down- 
ward bias will result in artificially large t statistics associated 
with individual coefficients and in a correspondingly large 
joint F-statistic for any causality test. Failure to filter the 
data adequately may leave some common serial correlation 
in the data that will bias the test statistics upward and possibly 
suggest a significant causal relationship where, in fact, none 
exists. 


Data Filtering 


The problem of serial correlation might be turned into a 
benefit if a common pattern among time series makes it pos- 
sible for one filter to transform successfully most economic 
series to the white noise processes required by the testing 
procedures. Such a filter would standardize the mechanics 
of testing procedures and let researchers conduct their tests 
of interest after a routine data transformation. Although no 
universal filter has been found, some applied researchers 
apparently believe that such a filter exists. This belief creates 
a problem for those who apply and interpret the results of 
causality tests. 


Belief in a universal data filter is probably the result of a 
statement that Sims made in his original article.” In a study 
which investigated the causal relationships between gross 
national product (GNP) and different measures of the money 
supply, Sims advocated filtering the data by the following 
transformation: 


Z = InX; - 1.5 nX;_, + 0.5625 InX;_» 


which is an expansion of (1 - KL)? where L is the lag opera- 
tor and K = 0.75. He said this filter ““approximately flattens 
the spectral density function of most economic time series, 
and the hope was that regression residuals would be very 
nearly white noise with this prefiltering” (12). As we shall 
see later, the spectrum of a white noise process will be flat 





“A similar problem results if the data are expressed as first 
differences, a data transformation suggested by Box and Jen- 
kins as a means of detrending a time series (3). 


and its plot can be used as one diagnostic check for the ade- 
quate filtering of data. 


The (unintended) result of Sims’ statement has been for 
researchers to apply his filter to a wide variety of economic 
time series without subsequent checking for whether this 
transformation actually has created a new white noise proc- 
ess. As the following examples will illustrate, many common 
economic time series are not transformed to white noise by 
Sims’ filter. This result is not wholly unexpected in view of 
the numerous and volatile shocks represented in the eco- 
nomic data since 1972 when his article was published. But, 
unless the data are transformed to white noise processes, the 
results of a causality test are invalid. 


Sims’ Filter Applied to Some Common 
Economic Series 


Prior to the description of an alternative approach to data 
filtering, it may be helpful to illustrate the potential prob- 
lems associated with not testing filtered data to determine 
whether the transformation has reduced a series to a white 
noise process. For this purpose, the following variables have 
been chosen: the narrowly defined money stock (Ml), average 
wage rates for the manufacturing sector (W), and the GNP de- 
flator (DEF).° Each series is monthly from January 1961 
through December 1977. Table 1 presents descriptive sta- 
tistics for each series—prior to transformation and after trans- 
formation by the Sims filter. 


The spectra for these series are plotted in figures 1-3. The area 
under a spectral plot is the variance of the data series. Because 
we have moved from the time domain to the frequency 
domain, the frequencies represented on the horizontal axis 
are measured from - II to II radians; however, because the 
plot is symmetric about zero, only the area from 0 to II is 
shown.’ 


Visually, the spectrum identifies any spikes in the plot asso- 
ciated with particular frequencies between 0 and II. The 
presence of a spike in the spectral density at a particular fre- 
quency suggests that a relatively larger share of the series’ 
variance is explained by that frequency. If the series is a 
white noise process, the spectrum should not contain any 
spikes because no one frequency would contribute more than 
any other to the explanation of variance. A visual check of 
the spectrum can be supported by the Kolmogorov-Smirnov 
and Fisher tests for white noise; these tests are described in 
several texts (see (5)). 





>The monthly GNP deflator series was provided by Data 
Resources, Inc. 

* The spectra for these series were estimated with triangular 
weights of the form 1-2-3-2-1 and 1-2-3-4-3-2-1; the results did 
not vary with the choice of weighting scheme. Issues associated 
with the choice of lag window and the problem of leakage are 
discussed in most time-series texts. 





Table 1—Descriptive statistics for variables’ 





Standard 


"sire Minimum | Maximum 
deviation 


Variable Mean 














In levels 


0.230 0.688 
57.168 144.000 
1.022 2.280 


.001 -.004 
Transformed by Sims’ filter 
DEF -.005 .015 -.025 


Ml 335 .024 .284 
W .364 .019 333 








1 Number of observations = 204. 


The Sims filter did not transform any of the three series to 
white noise. The spectral plots display a result common to 
most economic series—that is, a large spike in the spectrum 
at low frequencies near the origin. A spectrum of this shape 
suggests that much of the variance of the series can be ex- 


plained by a strong trend in the data. However, the intention 


of the filtering was to remove elements of trend so that the 

transformed series would be a white noise process. The plots 
in figures 1-3, which result from filtering common aggregate 
series by Sims’ method, show clearly that this method is not 
appropriate for these data. The conclusions suggested by the 
spectral plots are supported further by the white noise tests 


Figure 1 


reported in table 2. For each series, the test statistic rejects 
the null hypothesis that the chosen filtering technique pro- 
duced a white noise process. Causality tests based on these 
data could yield invalid test statistics. 


Figure 2 
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Figure 3 
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Table 2—Estimated values for the Fisher Kappa statistic 





Estimated Fisher 
Kappa’ 


64.31 
59.89 
30.76 

5.89 


Variable 











* All estimated values are to be compared with a critical 
value of 7.38. Critical values are provided by (5, p. 284). 


An Alternative Approach to Filtering 


One may search for an appropriate filter by changing the 
value of K in Sims’ transformation or by attempting similar 
ad hoc data manipulations. We will now present an alternative 
approach to filter selection and use one of the previous series 
to illustrate its practical application. Although this technique 
involves some preliminary data analysis, it can provide a 
reasonable guide to the identification of an appropriate data 
transformation. 


The suggested alternative to filtering uses the estimated co- 
efficients of an ARMA model to transform each data series. 
This approach requires identifying each series based on an 
analysis of its autocorrelation and partial autocorrelation 
functions, estimating the identified model, and finally, 
getting the model’s estimated residuals. If the fitted model is 
the appropriate time series representation of the data, its 
residuals will be white noise. However, identification of the 
most appropriate time series representation of a variabie is 
not always a simple task: “It is important to realize the speci- 
fication of an ARMA model is an art, rather than a science” 
(10). 


The residuals from each model can be tested against a null 
hypothesis of white noise in the same manner already 
described. Once a vector of white noise residuals is created 
for each series, the actual causality tests can be run by use of 
these residual vectors. The causality test will regress one vec- 
tor of residuals on past and future values of the other residual 
vector. A joint F-test on the significance of the coefficients 
associated with the future values will then indicate if, after 
proper filtering, one series still contributes to the explanation 
of the variation in the other. 


To illustrate, the GNP deflator (DEF) will be identified and 
then estimated by an ARMA model with the residuals from 
the estimated model subsequently tested for white noise. If 
we were to test for a causal relationship between DEF and 
another variable, the same steps would be followed, thereby 
creating a residual vector for that series. We could then per- 
form the actual causality tests by regressing the residual vec- 
tors on each other as described earlier. However, our purpose 
here is only to illustrate an alternative approach to data 
filtering. 


The first step involves the identification of an ARMA model 
which will represent the process that generates values for the 
series. Procedures described by Box and Jenkins and found in 
a variety of time series texts suggest that one can identify a 
model by analyzing the autocorrelation and partial correla- 
tion functions of the series. The plots of these functions 
indicate how many autoregressive and moving average terms 
to include in the model. For DEF, these plots suggest that a 
third-order autoregressive model will adequately represent 
the process which generates values for DEF.* That is, we 
have identified a model of the form: 


DEF; = a + 8, DEF,_} + 8gDEF;_» + B3DEF;_3 + & 


to represent the DEF series. 


With estimated values for the a and (; terms, a residual vec- 
tor can be created by simple manipulation: 


This estimated residual vector is the filtered series which 
would be used in a causality test. If the estimated ARMA 
model is the correct model for the DEF series, the e, vector 
should be white noise. 


To test the residuals as a white noise process, one can employ 
plots of the series’ spectral density and white noise tests. A 
flat spectral density indicates a white noise series because no 
particular frequency between 0 and I! radians makes a rela- 
tively larger contribution to the series’ variance than any 
other. This visual check can be supported by a white noise 
test provided by Fisher (see (5)). 


Figure 4 shows the spectral density for e,. Table 2 shows the 
Fisher Kappa statistics for all variables. The plot reveals no 
particular pattern in the relationship between frequency and 
the height of the spectral density. This flat spectrum would 
suggest that e, is a white noise process. This result is supported 
by a test statistic of 5.89, which is less than the critical value 
of 7.38 for 100 degrees of freedom.° From these test results, 
one can conclude that the residual vector from the estimated 
ARMA model is a white noise series. Thus, the e; vector is 
an appropriately filtered representation of DEF ‘hat could 
be used in a causality test. 





5 One should note that, although a third-order auto- 
regressive model was found to be appropriate for the DEF 
series, this same model may not appropriately represent 
other series. In fact, the DEF series could most likely be 
represented and transformed to white noise by something 
other than an AR(3) model, a possibility discussed in most 
time series texts. However, the point remains that filtering 
data by using the coefficients of an ARMA model requires 
that each time series be analyzed individually rather than be 
transformed by some common manipulation. ee 

* The Fisher Kappa does not follow a standard F distri- _ 
bution. Fuller provides the correct critical values against which 
the estimated test statistic should be compared (5, p. 284). 
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Figure 4 
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Conclusions 


One of the requirements of causality tests is that the data 
series be filtered to create white noise vectors from the 
original series values. Sims suggested a filter which he ap- 
parently believed would adequately transform most eco- 
nomic series—at least prior to 1972. We have demonstrated, 
however, that this filter does not properly filter a variety of 
common economic series, thus invalidating their use in caus- 
ality tests. 


A suggested alternative to filtering involves estimating an 
ARMA model. The residuals from the ARMA model are the 
filtered version of the data series. The spectral plot for the 
residual vector and tests for white noise can be used to 
indicate whether the results are, in fact, a properly filtered 
white noise process. Results based on the GNP deflator (DEF) 
indicate that Sims’ filter does not produce a white noise series 
but that the residuals from an ARMA model are white noise 
and can, therefore, be used correctly as one variable in a 
causality test. 
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Input Demand Formulations and Duality Theory 


By Thomas J. Lutton* 








Abstract 


It is difficult to obtain closed-form solutions for input demand equations that are conducive to 
econometric estimation and policy analysis when the production function is unknown. Develop- 
ments in the theory of duality facilitate obtaining such equations. This article provides three 
examples of dual approaches to neoclassical constrained optimization. One can obtain input 
demand equations and all comparative statics effects from the first and second derivatives of 


these dual functions. 
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Policy analysis of input demand frequently requires the 
systematic assessment and projection of the effects of an 
input price change on input demand, costs of production, 

and output. When comparative, static, competitive equilibrium 
assumptions are appropriate and when the typical producer’s 
production possibility set is known, neoclassical production 
theory provides the analyst with an analytical framework for 
evaluating such effects. 


Unfortunately, the policy analyst, unlike the producer, seldom 
has a quantitative a priori characterization of the production 
possibility set or of the production function that determines 
input demand. Assuming that the analyst chooses a neo- 
classical parametric approach to input demand formulation, 
he or she typically assumes some approximate functional 
form for technology, uses this form in a constrained optimi- 
zation, and derives the input demand equations from the first- 
and second-order conditions. The analyst attempts to use 
observations consisting of input and output quantity and 
price data directly or indirectly to estimate an input demand 
function. 


Advances in duality theory allow us to formulate a system of 
input demand functions which can be used to examine such 
issues as input price sensitivity, factor substitution, and out- 
put effects. Cost, indirect production, and profit functions 
are particularly useful for specifying and estimating input 
demand functions and for analyzing comparative statics. When 
the production function is unknown, these input demand 
functions are more easily derived with the dual approach 





*The author is an economist with the National Economics 
Division, ERS. He wishes to acknowledge and thank Anthony 
Prato and Lloyd Teigen, ERS, and Robert Chambers, Univer- 
sity of Maryland, for comments on an earlier draft. 


than with the traditional, constrained optimization ap- 
proaches. When the production function is unknown, the 
dual approach circumvents both the statistical problems 
associated with direct estimation of production functions 
and the computational difficulties in obtaining input de- 
mand functions for estimation. 


The analyst pursuing a neoclassical parametric approach has 
three choices: to econometrically estimate the assumed 
production function and use this functional form in a con- 
strained optimization to derive input demand schedules, to 
estimate the first-order conditions assuming some functional 
form for the marginal product of each input, or to estimate 
the reduced-form input demand system which explicitly in- 
cludes both first- and second-order conditions.’ 


In the first approach, one can estimate the parameters of 

the production function with single-equation estimators, 
ignoring factor prices. There is, however, a statistical problem 
with this approach. Such estimates of the parameters of the 
production function will be biased if inputs and outputs 

are jointly determined. Varian provides an example of bias 

in the direct estimation of a production function for corn 
(34). If the regressors (input quantities) in a production func- 
tion are correlated with the error term, ordinary-least-squares 
estimates will be biased. In addition, if the production func- 
tion is unknown and is approximated by a truncated expan- 
sion such as a Taylor series to avoid predetermining the elas- 
ticities of substitution, the number of parameters and agru- 
ments (squares, cross products, and so forth) in a single equa- 
tion for more than three inputs becomes extremely large, 





' Nonparametric approaches to input demand are con- 
tained in (1, 11, 16). Note: Italicized numbers in parentheses 
refer to items in the References at the end of this article. 
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complicating the estimation and possibly reducing the ana- 
lyst’s confidence in the parameter estimates.” 


The second option is to estimate the first-order conditions 
with factor prices as dependent variables and input quantities 
as exogenous variables. If the producer is assumed to be a 
price taker, the causality is reversed in the econometric model. 
Furthermore, if input quantities are jointly determined, 
multicollinearity problems may be severe. 


The third option is the preferred method for analyzing the 
systems effects of an input price change, data permitting and 
assumptions warranted. However, a difficult methodological 
problem arises with this approach. It is generally possible to 
obtain the reduced-form or closed-form solution only for the 
simpler functional forms of the production function. This 
difficulty may prevent formulation of the input demand 
equations derived from any second-order approximation to 
production functions which are more complex. Such flexible 
approximations are increasingly used for econometric esti- 
mation and for subsequent, comparative static analysis be- 
cause they avoid unnecessary restrictions on the elasticities 
of substitution. 


The theory of production duality has its roots in Hotelling 
(18), Hicks (17), Samuelson (30), Shephard (31), McFadden 
(23), Uzawa (33), and Diewert (9). The dual approach re- 
mained a didactic tool until Nerlove (27) used a cost function 
to derive and estimate reduced form input demand equations. 
Diewert (9), Fuss (12), Christensen, Jorgenson, and Lau (5,6), 
Denny (7), Norman and Russell (28), and Berndt and Kahled 


(3) contributed flexible functional forms for dual relationships. 


The emphasis here is on input demand formulation for em- 
pirical analysis. I derive input demands from three alternative 
objective functions for the competitive producer: output- 
constrained cost minimization, cost-constrained output 
maximization, and profit maximization. Each yields a de- 


rived demand formulation with different exogenous arguments. 


The producer is assumed to be a price taker in both input 
and output markets. Here I briefly review comparative statics 
using the neoclassical approach and compare this approach 
with dual formulations when the production function is 
unknown. 





? Consider a translogarithmic production function where 
output (Q) is expressed as a function of input quantities 
X = (Ky, Ma -. Hy): 
. ir 
InQ =a, + 2 Bink; °. 2 2B, ;In X,InX; 


ee 


where ag, B;, B,; are constants for i, j = 1, 2, ..., n. If By; = Bj 
and n = $, 24 coefficients must be estimated. 
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Neoclassical Comparative Statics 


As Mundlak (26) has demonstrated, it is possible to cast all 
three problems in a common framework. A competitive firm 
is assumed to have a twice differentiable production function: 


Q = F(X) (1) 


where Q is an output and X a vector of input quantities; 
R= (Kj, Ke, --9 Bn) 


The first order conditions for each of the three constrained 
optimization problems can be written as: 


F, (2) 


1 


where F; = 0F(X)/0X,, P; is the price of factor i, and A is a 
Lagrangian multiplier. Following Mundlak, the displacement 
from equilibrium for these three cases can be represented as: 


meas a | “ 


dX, dP, 
F, F,, Fio.-- dP, 
dX» 
Fo Fo Foo see : 




















P Fas Pen >s- 
[az | 
dP, 
dP, 














| aPa| (3) 


where h= d log \. The three cases differ in the measurement 
of dZ, the value of the constraint at equilibrium. These values 
are respectively: 


(a) Cost minimization 


1 
(dZ = x dQ = 0) (4a) 


(b) Output maximization (dZ = dC - 2X;dP; 








(c) Profit maximization Or = (0, 


dZ =~ (1/K,,)[Ko;][4P]) 
(4c) 


Utilizing equations (3) and (4), Mundlak shows that the sub- 
stitution effect in output-constrained cost minimization is 
identical to that in cost-constrained output maximization 
and profit maximization. Similarly, cost-constrained output 
maximization yields an output effect which is identical to 
that in a profit-maximizing adjustment. The distinguishing 
feature among the derived demand formulations is what re- 
mains fixed: output (Q), expenditures (C), or output price 
(PQ). Using Mundlak’s notation: 


Substitution Effect (dQ = 0): 


rs) X; 


OF} 1dQ=0 


Substitution and Output Effect (dC = 0): 


1 


OX; ac OX; 


ap. Uap ac (0) 
OP; | dQ=0 OP; |aq=0 9C Jap=o 


Substitution, Output, and Maximizing Effect (dPo = 0): 


OX; ac OX; 
Ki *%j- 


= = J ai 
OP; | ai=0 dP; | aq=o0 9 | ap=o0 





The desirability of the closed-form solution is apparent, 


because Mundlak has demonstrated that the substitution, out- 


put, and maximizing effects of an input price change may be 
obtained with partial derivatives of the closed-form solu- 
tion, thus making inversion of the bordered Hessian unnec- 
essary. Except for restrictive functional forms, the input 
demand formulation is difficult when the researcher starts 
with a neoclassical approach as described by Mundlak. 


Cost Function: A Dual Approach 


Fortunately, one need not specify the production function 
to derive the input demands. Consider the cost-minimizing 
input demand. Using the duality principles of Shephard 
(31) and Samuelson (30), Diewert (9) demonstrated how 
to obtain the cost-minimizing input demand by postulating 


the existence of a generalized cost function, dual to F(X), 
and by employing Shephard’s lemma, how to derive the 
optimal input demand, x? The cost function C(Q,P,), 
where costs are a function of output and input prices, is 
defined by: 


Min C = C(Q, P) = Min (EP,X; | F(X) > Q and X, P>0) 


= PX; (Q, P) (6) 
where X7(Q, P) is the closed form solution for input i. 


Shephard, Samuelson, and McFadden have established the 
duality between cost and production functions. Given one 

of these functions, together with the usual convexity, homo- 
geneity, and differentiability assumptions made with respect 
to it, the other function is uniquely determined (32). Fora cost 
function C(Q, P) to be dual to a production function, it must 
satisfy the following properties: 


Positive for all Q and P, 
Nondecreasing in Q and P, 
Linearly homogeneous in P, and 
At least quasi-concave in P. 


The production function dual to the above cost function will 
be of the classical type: nonnegative for nonnegative X, non- 
decreasing in X, and at least quasi-concave in input quantities. 


An important advantage of the dual approach to production 
analysis is that it often affords a simpler analytical procedure 
than the traditional approach. If a well-behaved C(Q, P) 
exists for a producer, Shepherd’s lemma states that the out- 
put-constrained input demand for the ith input is equal to 
the partial derivative of C(Q, P) with respect to P;: 


x 9C(Q, P) 
Xj = 
oP: 


Moreover: 


aX; 


82C (Q, Py, ..-, P,) 
ot: - gn Kjj 
An example of a Cobb-Douglas type cost function with its 
dual production function is provided. Write the cost func- 
tion as: 





c* and X." to designate the 
iated with the cost-minimizing, 


3We use the notation x", 
optimal input quantities assoc 
output-maximizing, and profit-maximizing objective func- 
tions, respectively. We use different notations for each input 
demand to emphasize that the comparative statics effects of 
an input price change are generally different for each. 





C(Q, P) = b,P?1P82Q (8) 


where b,,, bj, and bg are positive constants. C(Q, P) is thus 
assumed to be increasing in P;, Pp, and Q. If we assume con- 
stant returns to scale in production, b; + bg = 1. The optimal 
demand for X; would then be expressed as: 
-1 b 

X} = byboP}? P2?Q (9) 
A Cobb-Douglas production function dual to the Cobb- 
Douglas cost function (8) may be written: 


bj]? b3°2 xP xb2 





(10) 
by 


The constrained Lagrangian cost-minimizing input quantity 
Xo, given equation (10), is identical to that in (9): 


Xt = b,b,Pe1” | pe2Q (11) 


Indirect Production Function: 
A Dual Approach 


Econometric estimates of input demands derived from the 
cost function have been applied to agriculture by Duncan 
and Webb (10), Mensah and Miranowski (23), and Reisner 
and LeBlanc (22). The cost function is appropriate for inves- 
tigating the factor-substitution effects of an input price 
change that is conditional upon output level. If, however, the 
producer is an output maximizer with a budget constraint, 
the cost function is inappropriate for derived demand formu- 
lation. Moreover, estimation of equation (7) will produce 
biased parameter estimates, as output is likely to be correlated 
with the error term. The objective of the analyst may be to 
evaluate the expected effect on output of an increase in an 
input price as well as the possible decrease in the input per 
unit of output. The substitution and output effects in equa- 
tion (5b) may be difficult to obtain in the constrained 
Lagrangian framework if F(X) is sufficiently flexible and if 
inversion of the bordered Hessian is required. 


However, one can obtain the combined substitution and out- 
put effects by using the duality relationships without taking 
the inverse of a matrix. Hanoch (15) shows that it is generally 
possible to postulate the existence of an indirect or polar 
production function dual to a given production function. 


Let us define the indirect production function as: 


n _ 
Q= Q(C, P) = Max F(X) s.t. © PX; =C (12) 
i=j 


where X7 is the optimal input quantity maximizing Q and 
satisfying the budget constraint. Q(C, P) is referred to as the 
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indirect production function in which output is a function of 
input prices and costs of production. In the same fashion that 
C(Q, P) may be uniquely derived from F(X) in output-con- 
strained cost minimization, Q(C, P) may be uniquely derived 
from F(X) in cost-constrained output maximization. For this 
derivation to be possible, the function Q(C, P) must possess 
the following properties: 


@ Decreasing in P and increasing in C, 
® Quasi-convex in P, and 
® Homogeneous of degree zero in P and C. 


It is possible to derive the optimal demand for an input (using 
Roy’s lemma) as: 





(13) 


oe 8Q(C, P) / IQ(C, P 
Xf = KP, c)-- in 


ap, ac 


where x? is the optimal input demand, given the cost-con- 
strained, output-maximization objective function. For any 
production function F(X), Hanoch proves the unique exis- 
tence of the indirect production function (Q(C, Px). Simi- 
larly, given a well-behaved indirect production function, it is 
possible to show that a unique production function F(X) 
exists. Thus, it is possible to specify Q(C, Py) and estimate 
input demands directly from the indirect production func- 
tion. Note that X; is defined in terms of cost and input 
prices, not in terms of output quantity and input prices. 
Evaluating the total substitution and output effect of an 
input price change yields: 

ax? aX? (C, P) 
— (14) 


Thus, it is possible to anticipate the change in the demand 
for an input i as the derivative of the demand function 

Xx; (C, Px). It is preferable to employ Q(C, P) rather than 
F(X) when one derives equation (13) and observes changes in 


input demand under cost-constrained output maximization if 
the form of the production function F(X) is unknown. 


To separate the substitution and output effects in equation 
(14), observe that if the form of Q(C, P) is known, it is 
possible then to write C = C(Q, P). Using Mundlak’s results, 
one can show that: 
_ 020(P, Q) 
ac-0  9Pj2 fag=o 
dC(P,Q) dQ(P, C) 
OP, =}aqg=0 9F ~ Jac=o0 








where the substitution and output effects, respectively, 
appear to the right in equation (15). Thus, one can observe 
the substitution and output effects without matrix inversion 
and with greater computational facility. 


Note that, if output is endogenous and expenditures are 
exogenous, there is no contradiction of underlying econo- 
metric assumptions when one derives the cost function from 
the indirect production function to obtain a substitution 
effect. However, if a cost function is initially estimated, 
assuming that output is exogenous, there is a contradiction 

if one attempts to do the reverse—that is, to derive the in- 
direct production function from the cost function to calculate 
the output effect. 


Consider the problem of output maximization subject to a 
cost or budget constraint with a two-input Cobb-Douglas 
production function (10). The first-order conditions may be 
solved algebraically to yield: 


>* r 
X5 = boCP5! (16) 


Identical input demands may be obtained from the indirect 
production function Q(P x, C). For the indirect production 
function: 


Q= C(b PP? P3?)-1 (17) 


and the input demands are: 
X] = b, CP]! = -(8Q/aP,)(aQ/aC)~! 


X3 = boCP5! = -(8Q/APy)(Q/8C)~! 


The Profit Function: A Dual Approach 


Fuss and McFadden (13), Lau (21), and Normal and Russell 
(28) have shown that it is possible to postulate the existence 
of a profit function dual to a production function not charac- 
terized by constant returns to scale (for example, F(X) in 
equation (10)), so that the locus of maximum profits for all 
Po and P may be written: 


II(PQ, P) = Max PoF(X) - C(Q, P) (18) 
X 


where II(PQ, P) is the profit function. Profits become a func- 
tion of output price and input prices. The profit function pos- 
sesses the following properties: 


© Positively linearly homogeneous in Py and Po, 
@ Quasi convex in P, and 
@ Decreasing in P, and increasing in Po: 


For any production function F(X) with these properties, a 
unique profit function exists. Similarly, given any profit 
function with the above properties, a unique production 
function can be shown to exist. 


The salient feature of the profit function used in formulat- 
ing the input demand function is that the optimal input 
demands can be obtained as the first derivatives of the 
profit function by use of Hotelling’s lemma (18): 


- dIl(PQ,P) 


= X* (Po, P 
ap. chins ti 


where R*(PQ, P) is the input demand function. 


Moreover, by the same lemma, the supply function is given by: 


= Q(PQ, P) (19b) 


OPo 


where Q(Pa, P) is the product supply function. Using equa- 
tion (19a), we may write: 


ax; d2I1(PQ, P) 
aR aP 


(20) 


am lap. « 
j dPo=0 


where (2X; /9P)) |dPQ = 0 embodies the substitution, output, 
and maximizing effects of an input price change. The second 
derivative replaces the constrained optimization matrix 
inversion. 


The separate effects of the input price change may be ob- 
tained by transformation similar to those used in transforming 
the indirect production function into a cost function to obtain 


substitution and output effects. The optimal profit level is 
defined as: 


I1(Pg, P) = Pg Q(Pq, P) - ZP;X] (Pa, P) (21) 
where Q(Pa, P) is derived from the supply function (19b). 
Solving implicitly for PQ: 

Pa = H(Q, P) (22) 


Substituting this expression into the input demand equation, 
we then obtain: 





x* = x*(H(Q, P), P) = X3(Q, P) (23) 


Equation (23) can be used to construct the cost function dual 
to the profit function at the profit-maximizing output level Q. 


C = DP.X*(Q, P) = C(Q, P) (24) 


Thus, the analyst can derive a cost function at output Q which 
can be used to obtain the substitution effect where dQ = 0. 


dC(Q,P 
ee! x. (25) 


aPjaP; ’ 


The analyst can also use C(Q, P) to obtain the indirect pro- 
duction function Q(C, P) and to obtain the output effect 

as in equation (15). The maximizing effect may be obtained 
by use of these transformations and equation (5c). 


Thus, one can obtain the total effect and partial effects with- 
out inversion of a bordered Hessian matrix by exploiting the 
dual relationship among the profit, indirect production, and 
cost functions. Moreover, Mundlak’s results can be used to 
decompose the effect for a partial equilibrium policy analysis. 
It is conceptually possible to obtain such effects for each 
variable input in production, with assurance of consistent 
aggregation. This process permits obtaining factor substitu- 
tion and output adjustments from a well-defined profit 
function, supply function, and system of input demand 
functions. 


Consider the problem of output maximization under a de- 
creasing returns-to-scale Cobb-Douglas production function, 
subject to a fixed output price. The objective function 
becomes: 


X 


Substituting equation (10) into (26), we derive: 


P b]?1b5°2 
Max II(Pg, P) = ———— 
X b 


Oo 


X71 XB2 - P)X, - PoXo 
(27) 
where b; + by < 1. 


The profit function dual to F(X) in (26) is written: 
bg,51,): 
I(PQ, P) = boPo*P, Pa” (28) 


Solving equation (27) for xi yields the same result as taking 
the derivative of the profit function with respect to input 
prices: 
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dTI(PQ, P) 

Z be,b1-1,b 

4° eg Py” 
1 


The profit-maximizing input demands may be obtained 
through either a constrained optimization or a dual approach. 
However, the dual approach is analytically easier to imple- 
ment when the production function is unknown and there are 
more than two inputs in the production process. 


Functional Forms 


Given the equivalence of the comparative statics effects and 
the nature of the dual technological characterizations, it is 
apparent that the analyst can proceed to derive input demand 
equations from cost, indirect production, or profit functions, 
depending on the assumed producer’s objective function. 

The choice of specific functional forms for estimating and 
deriving input demand equations inevitably involves com- 
promises. Fuss, McFadden, and Mundlak (J 4) list the fol- 
lowing five criteria to help distinguish among forms: 


. Parsimony in parameters, 

. Ease of interpretation, 

. Computational ease, 

. Interpolative robustness within sample (concavity, 
monotonicity, and so forth), and 

. Extrapolative robustness outside sample (concavity, 
montonicity, and so forth). 


Among the more common functional forms providing a 
second-order approximation to any twice differentiable func- 
tion are the following: 


Generalized Leontief (8), 
Translog (5), 

Generalized Cobb-Douglas (9), 
Quadratic (20), 

Generalized quadratic (28), 
Generalized concave (14), and 
Generalized Box-Cox (3). 


The selection of a particular functional form for input demand 
analysis depends on the objectives of the analyst. These flex- 
ible functional forms often exhibit weakness in criteria 1, 3, 
and 5, The analyst must choose the form which best meets 
the objectives. For example, if the analyst wants to obtain 
input demand equations in a particular form such as input/ 
output ratios, a generalized Leontief functional form may be 
more useful than the translog form. If technological augmen- 
tation is the primary objective, a translog may be superior. 
The generalized Box-Cox form serves as a parent function for 
both the generalized Leontief and the translog, but is exceed- 
ingly difficult to formulate for either the indirect production 
or profit functions. Nevertheless, it is most useful for testing 
maintained hypotheses concerning the degree of production 








flexibility for cost functions. No one form serves all analytical 
purposes. 


Applications of dual cost or profit functions intended to 
examine input demand include Norman and Russell (28), 
Berndt and Wood (4), Fuss and McFadden (13), and Christen- 
sen, Jorgenson, and Lau (5). Few applications of indirect 
production functions are available, but indirect utility func- 
tions are used by Houthakker (19) and by Christensen, 
Jorgenson, and Lau (5). 


Conclusion 


The use of cost, indirect production, and profit functions 
simplifies the discussion of changes in input demand because 
it implicitly embodies both the objective function and the 
production function. This approach requires few constraints 
on the form of the underlying technology from which the 
demand for inputs may be derived. These functions are par- 
ticularly useful for estimating the effects of input price 
changes on input demand. Duality principles may be extended 
to problems of analyzing variable input demand with fixed 
inputs (28), technological change (3), uncertainty (29), and 
intertemporal decisions (2, 24). 


Although the discussion of input demand is couched in a 
steady state equilibrium framework, Fuss and McFadden (13) 
and Berndt (2) have explored theoretical and empirical exten- 
sions of duality principles to evaluate production flexibility 
and disequilibrium considerations for analyzing input de- 
mand under input price changes. These approaches are par- 
ticularly useful for many problems in which the production 
functions are unknown and steady state equilibrium assump- 
tions may be questionable. 


The duality approach still requires the researcher to make 
decisions concerning the variables under the control of the 
decisionmaker, the producer’s objective function, and the 
appropriate characterization of technology from which in- 
put demand is derived. Although the duality approach pro- 
vides a computationally more attractive alternative to input 
demand analysis, it requires similar assumptions concerning 
the nature of the technological characterization and the 
objective function as well as a comprehensive data base. 
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Estimating Input Cost Shares for Agriculture 
Using a Multinomial Logit Framework 


By Michael LeBlanc* 








Abstract 


Many econometric analyses include dependent variables constrained to the interval between zero 
and 1. Under such conditions, simple regression procedures break down. Several alternative stochas- 
tic models which avoid this problem can be defined depending on the assumed error structure. Two 
alternative forms of the logit model are treated here. The multivariate logit approach assumes that 
the share specification is an accurate representation of the underlying input demand structure. The 
multinomial logit approach treats the dependent variable as a probability with a multinomial 


density. 
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Many econometric analyses include dependent variables which 
are constrained to the interval between zero and 1. Typical of 
this type of analysis is the simultaneous estimation of input 
cost shares. Development of the translogarithmic (translog) 
cost function has increased interest in estimating systems of 
input share equations (10, 11). Cost functions and underlying 
share equation systems have been estimated by Christensen and 
Green (9), Berndt and Wood (6), Denny and Pinto (1/2), and 
Humphrey (/8).' From share equations it is possible to derive 
input price and substitution elasticities (5,20, 25). 


However, there is no implicit or explicit mechanism constrain- 
ing the prediction of input shares to between zero and 1 by 
use of simple regression procedures. Predictions may fall out- 
side the zero-1 interval and, because of the grouped nature of 
the data, error terms are likely to be heteroscedastic. The 
objective here is to outline two versions of a logit model which 
explicitly force predicted input shares to sum to 1. The logit 
is a sensible and convenient alternative to the limited depen- 
dent variable problem encountered when one estimates input 
share equations derived from a translog cost function. 


First, I describe the underlying structure of the logit model. I 
present two alternative forms of the logit model and discuss 
an estimation methodology for each. Finally, I cite an ex- 
ample of a multinomial logit model where a maximum likeli- 
hood technique is used to estimate input cost shares for 
agriculture. 





*The author is an economist with the National Economics 
Division, ERS. 

' Italicized numbers in parentheses refer to items in the 
References at the end of this article. 


The Logit Model 


The logit model is one approach to the analysis of a general 
class of problems termed discrete choice behavior. These 
types of problems have been investigated in other fields for 
many years (2, 3). Economists have begun to analyze prob- 
lems which could be cast in this framework by considering 
discrete choices as selections from a continuum of alterna- 
tives, thereby integrating these problems into the theory of 
the household and firm. 


Statistical economic analysis of the general population or the 
choice behavior of the average firm or consumer is compli- 
cated because such behavior must be described in probabil- 
istic terms. The probability of a particular choice is condi- 
tional on the explanatory variables selected for the analysis. 
A discrete choice framework has been used to examine many 
problems including migration (28), occupational choice 

(7, 30), demand for housing (35), and demand for consumer 
durables (16, 33). 


Interest in the logit derives from its relatively simple struc- 
tural form (29). The logit forces estimates of the dependent vari- 
ables to be between zero and 1 while summing to unity. It 
allows for a wide array of functional forms, although cer- 
tain functions are more easily applied. When applied in con- 
sumption analyses, the logit allows for nonunitary income 
elasticities, and when applied in production analyses, it does 
not place a priori restrictions on elasticities of substitution. 
Recent applications of the multinomial logit analyze the ex- 
penditures of firms on inputs (20) and of consumers on 
household goods (34). 
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The basic structure of the logit expenditure system is written 
as: 


(i=1,2,...k) (1) 


k 
Sj ~ exp(ei(x))/ 2 exp(ej(x)) 


where §; is the share or proportion of total expenditures spent 
on the ith good, x is a vector of explanatory variables such as 
prices or income, and ¢; is any mathematically well-behaved 
function linking the explanatory variables and the dependent 
share variables. 


Two alternative forms of the logit transformation can be 

used for empirical analysis. The multinomial form of logit, or 
conditional logit (27), treats the dependent variable as a prob- 
ability with a multinomial density. The multivariate logit 
emerges from a more ad hoc prediction-oriented approach 
which assumes that equation (1) is an accurate functional 
characterization of expenditures. With either form, one can 
incorporate many factors which influence expenditures, 

such as family or firm size, as explanatory variables while 
maintaining a theoretically consistent expenditure system. 


Multinomial Logit 


The theoretical foundation of the multinomial logit sets the 
decisionmaker in an environment where discrete choices of 
expenditures are made. If it is assumed that the larger the 
value of an index the greater the probability that the event 
will occur, then one can define a monotonic relationship link- 
ing the value of the index and the probability of the event’s 
occurrence, 


The decisionmaker’s choices are described in a decision index 
of the form: 


c=c(x) +e (2) 


where e is a random disturbance associated with a given prob- 
ability distribution and c(x) is nonstochastic. Faced with k 
alternatives, the decisionmaker will choose alternative i only 
if c;(x) +e; > c;(x) + e; for all } #i. The probability of this 
event occurring is: 


P; = Prob[c;(x) + e; > ¢;(x) + e;)] forallj#i (3) 


and, therefore: 


P; = Prob[e; - ej; > c;(x) ~ ¢;(x)] forallj#i (4) 


One must make binary comparisons of c;(x) with each alter- 
native c;(x). Probability distributions which are closed under 
subtraction, or produce convenient distributions when sub- 
tracted, are particularly attractive candidates for the prob- 
ability densities of e (19). 
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If F(e 1, €9, ..-, €,) represents the cumulative distribution 
function of the disturbances e; and if F; denotes the marginal 
density function of e; (the derivative of F with respect to ej), 
then the probability P; is written: 


Pi “| Fi[(e1 + 1 (x) ~ C(x), --- &y + K(X) 
(5) 


- c;(x) de; 


If the errors are independent and identically distributed, then 
a necessary and sufficient condition for the model described 
by equation (4) to yield the conditional logit form is that the 
errors have a Weibull density (27). That is: 


e; ~ exp (-exp(-e;)) (6) 


The Weibull density is a convenient way to generate the 
logistic distribution. It is closed under subtraction and closely 
approximates the normal distribution while being numerically 
simpler. McFadden (27) indicates that the underlying choice 
structure implies the independence of irrelevent alternatives 
axiom. The independence of irrelevant alternatives condition 
is both a strength and a weakness of the logit expenditure 
model (/4). It is a strength because introducing additional 
alternatives does not alter the relative odds with which pre- 
vious alternatives are selected. It is a weakness because it 
requires that the cross-elasticity of demand for each old ex- 
penditure category, with respect to an attribute of a new 
category, is uniform across all old categories. 


If the multinomial form of the logit is used to describe 
producer expenditures on inputs, then a set of k independent 
conditional probabilities are assumed to jointly determine the 
allocation of expenditures into k input categories. The prob- 
abilities have a logistic structure and are conditional on input 
prices and other explanatory variables. The probabilities are 
written as: 


k 


P= ee 2 er (¢;(x)) (i=1,2,...,k) (7) 
j= 


where P. is the conditional probability of $1 being allocated to 
input i, x is a vector of explanatory variables, and c;(x) is a 
decision index. 


Because the probability P,; is unobservable, the model is made 
operational by use of the share of the cost of production as- 
sociated with input i as a proxy (31). Equation (7) is there- 
fore rewritten as: 

k 

S; = v;q;/M = exp (e())/ 2 exp (c;(x)) 
j= 
(8) 
(i= 1, 2, ..., k) 








where M is the sum of each input (q;) used in production 
multiplied by its input price (v;). 


Before the multinomial logit input expenditure model can be 
estimated, a functional form must be selected for the decision 
index c;(x). The function may include input prices, output, 
and all other production or producer characteristics consid- 
ered relevant. Although no specific restrictions on c;(x) are 
necessary to estimate the multinomial logit model, estimation 
is easier if the functions are assumed to be linear in their 
parameters and invariant in structure between equations 
except for interequation parameter variability. The functions 
are written as: 


Gi=1,2,..,.k}) (©) 


n 
= » B.-h;:(x) 
ja. i 


where Bij are parameters and hij are functions. 


The expenditure of $1 on a given input is analogous to sam- 
pling with replacement from a population classified into k 
categories. The resulting multinomial distribution is written 


(10) 


where E; equals v;q; and is non-negative. For a sample of T 
observations, the logarithm of the likelihood function as- 
sociated with equation (8) is written as: 


T /k k 
InL = Constant + 2 ( L Sipcie - In ( E exp(cit))) (11) 
t=1 ‘i=1 i=1 


Although the functions c;(x) may take any form, functions 
which are linear in parameters are sufficiently flexible for 
most purposes. A linear form also leads to simple expressions 
for the maximization of the likelihood function. Therefore, 
if: 


ae 


then, the first-order conditions for the maximization of the 
log-likelihood function are: 


x 
dInL/OB,; = 2 xitlSit -P.J-°0 (i=1,2,...,k) (13) 
(j= 1, 2, ..., n) 


The second-order conditions for the maximization of the log- 
likelihood function are: 


T 
d2InL 0B; 0B; - Z rieePie Pit - H< 9 


(i 


(14) 


where both parameters (denoted by j and j’) are in the ith 
equation and: 


d7InL/AB;j0B; - ge tPi't <0 (i > E 2, ooeg b) 


i ee 


where each parameter is from a different equation. 


The maximum likelihood estimators are invariant to mono- 
tonic transformations of the likelihood function. The log- 
likelihood function is maximized where the first-order de- 
rivatives are zero (equation (13)) and the matrix of second- 
order derivatives, formed by the derivatives in equations 
(14) and (15), is negative definite. Because the first and 
second derivatives are nonlinear in parameters, an iterative 
search procedure is needed to solve for the parameters Bjj- 


The estimation procedure is complicated by an indeter- 
minancy in the equation system which arises because the 
sum of the shares must equal 1. The share equations are, 
therefore, invariant with the addition of the same expression, 
InZ for example, to each decision index. The indeterminancy 
causes the matrix of second-order partial derivatives of the 
log-likelihood function to be singular. 


One can avoid the singularity problem by normalizing the 
parameters for a particular variable in the k functions (34). 
The normalization does not affect the predicted shares. A 
straightforward approach is to divide the k - 1 equations by 
the kth equation. In a logarithmic form, the share system is: 


n 
j=l 


Maximizing InL with respect to c; is equivalent to maximizing 
InL with respect to c; - c, for any k (34). It is only necessary 
to consider k - 1 equations as the information provided by 
the kth equation is constant. Furthermore, regardless of the 
normalization employed, the predicted values of S; are iden- 
tical. Therefore, equation (16) can be used to estimate input 
cost shares by use of linear regression. One can compute the 
individual input cost shares, S;, from the regression results 
while forcing the shares to sum to unity. 





Multivariate Logit 


An alternative stochastic form of the logit uses the share 
system given in equation (8) directly. The share equations are 
interpreted as an accurate characterization of producer expen- 
ditures for inputs (8, 22). One can generate the stochastic model 
structure by appending error terms e; to each share equation. 


Each equation’s error term represents deviations between 
optimal cost shares and observed cost-minimizing shares. 
There are several reasons for the existence of the error term: 
the failure of input markets to clear perfectly, the aggrega- 
tion or measurement error, or the randomness of human 
behavior. The error term associated with the ith cost share 
is assumed to be distributed normally with mean zero and 
variance 0;. The variance is not assumed to be constant 
across shares because the variance of e; is generally not 
equal to the variance of e;. 


Three covariances are generated by the error terms in the 
share equations. One of these is the covariance between 
disturbances of different observations and of the same share 
equation: 


0;; = E(ejt, ejt') (t Ht’ = 1, 2, ..., T) (i= 1, 2, -.., k) (17) 


The second represents the relationship between different share 
equations and observations: 


04; = E(eit, ejy')(t Ft’ = 1, 2, ..., THiF#j=1, 2, ..., k) (18) 
Both these covariances are assumed to equal zero. 


The third covariance arises from the combination of different 
share equations and the same observation: 


05;= E(ei¢, ej) (t= 1, 2, ...,T) (i #j=1, 2,..., k) (19) 
This covariance is usually referred to as the contemporaneous 
covariance (32). Because the underlying production structure 
is estimated as a system, it is unlikely that the contemporan- 
eous covariances are zero. By appropriately stacking the share 
equations, one can write the variance-covariance matrix for 
the disturbance term as a block-diagonal matrix with T diag- 
onal submatrices. 


This matrix represents the interdependency of the k share 
equations for each observation (t = 1, 2, ..., T). The off- 
diagonal submatrices of the error system’s variance-covariance 
matrix are zero by assumption. 


The specified system of share equations is characterized as a 
seemingly unrelated regression problem (41). Zellner’s gen- 
eralized least-squares procedure cannot be directly applied, 
however. Because the disturbances of the share equation must 
sum to zero, the estimated variance-covariance matrix nec- 
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essary for implementing Zellner’s procedure is singular. One 
can transform the share equations to an estimable form by 
normalizing with the kth share equation. 


Parameters estimated by the Zellner generalized least-squares 
procedure are not invariant to the choice of common denom- 
inator share when an estimated variance-covariance matrix is 
employed. However, maximum likelihood estimates are in- 
variant to which equation is deleted (1), and iterating the 
Zellner procedure leads to maximum likelihood estimates 
(13, 21). Therefore, iterating the Zellner procedure is a com- 
putationally efficient means of deriving parameter estimates. 
In general, the properties of maximum likelihood estimators 
only hold asymptotically. However, most of the maximum 
likelihood estimators’ asymptotic properties are present in 
small samples (21). 


If the same set of regressors is utilized in all k - 1 equations, 
then the iterative Zellner procedure and ordinary least squares 
give identical parameter estimates. However, estimates of the 
standard errors may differ. 


An Example of a Multinomial Logit Estimation 


A multinomial logit model is used to estimate a system of 
cost share equations for agricultural inputs. In addition, the 
price elasticity of demand for each input is calculated by use 
of the multinomial logit parameter estimates. Although the 
procedure allows one considerable flexibility in selecting a 
functional form and avoids the limited dependent variable 
problem, it is still subject to the same practical difficulties 
(for example, data aggregation) in applying all econometric 
models. 


Estimated Form 


The underlying model structure estimated in this example is: 


k k k 


S; = exp (a; + Pa! papel 2 0 (a; + ar? (20) 


where §; is the ith input share, a and B are unknown param- 
eters, and vj is the price of input j. 


Because the model specified in equation (20) leads to a 
singular matrix of second-order derivatives, a maximum likeli- 
hood procedure cannot be applied. Instead, one can trans- 
form the share system to an estimable form by normalizing 
on the kth equation. After taking logarithms, the estimated 
form of the share system is: 


= By, ;) Inv; 


k 
In(S;/S,.) = (a; - ay) + «Oy 
j= 


(i=4,2,.,k- 2) (28) 








The parameters a; and B; are now defined as differences from 
the parameters of the kth equation. 


Data and Estimating Methodology 


Production cost data were developed for land, labor, fertilizer, 
energy, and capital inputs for the United States. Cross-sec- 
tional data for 39 continental States for 1974 are utilized 

(15, 24, 38, 39, 40). 


Because the likelihood function associated with equation (21) 
has non-linear first and second derivatives, an interative search 
procedure is used to solve for the maximum likelihood esti- 
mates. Tyrrell (34) developed the computer software used in 
this analysis to analyze household expenditure patterns. 
Tyrrell employs both first and second derivatives in an ex- 
tended Newton-Raphson procedure (see (17, 26)). The final 
output includes a vector of estimated coefficients, first- and 
second-order derivatives of the likelihood function, asympto- 
tic standard errors, and asymptotic t-statistics. 


Model Results 


I estimated the expenditure system by normalizing on the 
capital cost share. Table 1 shows the estimated parameters 
and asymptotic standard errors of the model specified in 
equation (21). Because maximum likelihood estimates are 
asymptotically normally distributed and the standard errors 
are asymptotically distributed as a chi-square, the ratios of the 
estimated parameters to their standard errors can be inter- 
preted as asymptotic t-statistics. 


The t-statistics indicate general support for the estimated 
parameters. Of the 24 parameter estimates, 16 exceed 1.0; 
of these, 10 exceed 2.0. The t-statistics for the own-price 
parameters for each input are the highest. The land and cap- 
ital price parameter for each input are the lowest. The low- 
price land and capital parameters may be associated with 
poor measurement of these inputs. 


Because the share system is indeterminant, it is not possible 
to identify each B;; in the expenditure system. The estimated 


Table 1—Multinomial logit parameter estimates’ 


parameters can only be interpreted as differences from the 
parameter associated with the normalizing input cost share 
(see equation (23)). 


The estimated likelihood function can be used to formulate 
statistical tests (4). The test statistic, - 2InA, is asymptotically 
distributed as a chi-square with degrees of freedom equal to 
the number of independently imposed restrictions when: 


A= (INRl/|Qyl)- 7/2 (22) 


where |Qp| and |{2y)| are the determinants of the restricted 
and unrestricted estimated variance-covariance matrices of 
error terms (32). The chi-square test allows for comparisons 
of different models if the restricted model is nested within 
an unrestricted model. In this analysis, the chi-square test is 
used to test the null hypothesis that all coefficients equal 
zero. The null hypothesis is easily rejected at the 1-percent 
level with 30 degrees of freedom. 


Although the estimated share system is normalized on the kth 
share equation (capital), it is possible to solve for the pre- 
dicted shares for all inputs. The kth share can be computed 
because: 


. k-1 ae 
1G, +( = exp (In($;/S;,)$,)))= 1 (23) 


where S;, is the projected cost share for the kth input (capital) 
and In(S78,) is the estimated dependent variable associated 
with the ith equation. The estimated share for the kth input 

is computed as: 


‘ k-1 “\ 
S, =1/(1+ ., exp (In(S; /S;,.)) (24) 


i= 


Table 2 reports the predicted and observed cost shares. The 
model fails to predict actual cest shares with a high degree of 
accuracy. The average absolute differences between predicted 
and observed input cost shares are 0.05 (land), 0.07 (labor), 





Input price 





Input 


Intercept 


-7.420 
(2.450) 


Energy 


-0.390 
(.280) 


-.009 -1.320 1.384 .632 
(.141) (.358) (.346) (.827) 


121 576 .806 .031 
(.129) (.359) (.322) (.785) 


-.023 1.249 ~.496 ~ 569 
(.183) (.492) (.370) (1.092) 


Capital | 


-0.653 
(.726) 


Land Fertilizer | 


0.111 
(.033) 





Land 0.816 
(.117) 


-1.040 
(2.842) 


-8.330 
(2.700) 


-6.350 
(3.546) 


Labor 


Fertilizer 


Energy 








le : 
Standard errors in parentheses. 











SuIWoAM 
UISUODSIM 
UO TSUTYSEM 
BIUISIIA 


Yeti) 

se xo] 
aessouua J, 
ejoyeq yynog 
euljoieg yyNog 


erueva[Asuuag 
uosa1O 
Buloye[y{O 
o1uO 

ejOy¥eG YON 


BUI[OIeD YON 
yIOK MeN 
OoIxap] MAN 
eyseiqaNn 
euezwUOy 


LInossIfy 
tddisstsstjq] 
eyJOSOUUIPY 

uBstyoI 
91.0" ; ; . . . pueyAreyy 


9L0° ‘ ° ag . 4 4 vueIsIno’y 
seo : C : ’ F : . Ayonquay 
vol j Ff i ; ,” ‘ sesuey 
€S0° { 99T ; j ; ; BAO] 
190° : A . . . , eueipuy 


690° 
€0T 
L90° 
990° 
080° 


soul] 
oyep] 
BIs10a4) 
epHoly 
aIBMBIIG] 


oD 09 LO ET 0 
ree Ne 


896° . 661° 
SZT 69T j 990° 
99Z° 092° q 680° 
Lvl 0&S ‘ Sot 
0SEe°0 a oe 6L00 890°0 


oOpelojog 
> BIUIOJIILS 
L8Z sesueyiy 
O&Z euozZuy 
L320 eweqeiy 


LO f= ri f= 
DAr-AO matOro 


JUIAIAg 





paaiasqg | peIpaig peaiesqo | peypaig paasasqg | peqoIpaig paasasqo | payoIpeig peasssqg | payopaig 








jeqyideg Asi90uq JIZI[I4AI loge] pue'y 














synduy 








aje7¢ Aq ‘sareys 380d paaAlasqo puke pazdIIpaig—Z ajqe], 





0.05 (fertilizer), 0.02 (energy), and 0.03 (capital), whereas the 
average observed cost shares are 0.28 (land), 0.17 (labor), 
0.20 (fertilizer), 0.08 (energy), and 0.28 (capital). If the ab- 
solute differences are compared with the observed shares, 
then labor and energy are the least accurate predictions and 
capital is the most accurate. The bad predictive capability of 
the model is largely attributed to the use of aggregate cross- 
sectional data. The appropriate unit of observation is the firm. 
However, no reliable set of input price and quantity data are 
available at the firm level. This problem is particularly true 
for energy. 


In addition to predicting the individual cost shares, the multi- 
nomial logit model enables the analyst to derive the price 
elasticities implicit in the derived demand for each input. If it 
is assumed that total expenditure on inputs, M, is not invar- 
iant to changes in input prices, then the own-price and cross- 
price eleasticities are approximated by: 


(i= 1, 2, 
and by: 


a i S; = vj(fj Ov; - §}0f;/Av;) 


] 


+k) (26) 


The indeterminancy caused by the adding-up constraint pre- 
vents direct calculation of the input price elasticities. How- 
ever, the predicted price elasticities can be derived if the 
elasticities associated with the kth share are derived first. 
The predicted elasticity for the kth share with respect to 
the ith price is: 


. " k-1, 
By = Sm 4 S; (Bj, - Byj) +1 (ifi = k; or O if i * k) 
j=1 2 


it 2s ee a (an 


and, therefore: 


Table 3 reports the average predicted own-price and cross- 
price elasticities. All the own-price elasticities have the right 
sign except labor, which is close to zero. The magnitudes of 
the own-price elasticities are similar to a translog specifica- 
tion with the same data (24). They do, however, differ 
significantly from translog specifications estimated with time- 


Table 3—Average price elasticities of predicted input demand 





Input price 





Factor 


Land | Labor } Fertilizer | Energy | Capital 





0.116 1.051 -0.559 -0.623 
-1.524 1.072 519 
-.539 .583 .008 
~1.647 -.521 
-.152 ~.600 


Land -0.152 
Labor - 190 .006 
Fertilizer .099 135 
Energy .027 921 1.206 
Capital -~San “En -.115 








series data (23). This discrepancy suggests that the data and not 
the model specification are the source of the elasticity 
differences. 


Unlike a translog system, cross-price elasticities are not con- 
strained to be symmetric. In fact, the cross-price elasticities 
for land and labor and for land and energy do not have the 
same signs. The cross-price elasticities indicate that land is a 
substitute for labor and fertilizer in farm production, but is a 
complement with energy and capital (36). Controversy still ex- 
ists over the relationship of energy to other inputs in manufac- 
turing. The results here indicate that energy is a complement 
with capital and is a substitute with labor in agricultural 
production. Although other analyses confirm the results for 
capital, results for labor are ambiguous and depend on 
whether the model distinguishes between long- and shortrun 
input substitution (23). 


Summary 


The logit model provides a flexible alternative to the more 
popular translog approach to estimating systems of input 
share equations. The logit allows for a wider range of explana- 
tory variables and functional forms than does the translog. 
Furthermore, the logit constrains share predictions to the 
interval between zero and 1. Two forms of the logit (multi- 
variate and multinomial) can be defined depending on the 
assumed error structure. For input expenditure systems, 

the multivariate logit assumes the share specification accu- 
rately represents the underlying input demand structure. The 
multinomial logit treats the dependent variable as a prob- 
ability with a multinomial density. Either model can be 
estimated with well-known techniques and can provide 
estimates of predicted input cost shares and price elastici- 
ties of demand. However, because of the indeterminancy of 
the share system, individual parameter estimates can not be 
identified for the multinomial logit model. 


The logit model has been applied to many economic prob- 
lems. Within the context of agricultural production, addi- 
tional applications might include dynamic models or incor- 
poration of the cost share system into a macromodel where 
agricultural production is only one of many production 
sectors. Because the logit’s structure forces share systems 

to sum to unity, it is an attractive candidate for maintaining 
consistent predictions. 
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Major changes in the agricultural sector have prompted many 
observers to question the necessity and desirability of main- 
taining specific tax provisions which benefit farmers—for ex- 
ample, cash accounting; expensing capital costs for groves, 
orchards, and vineyards; and allowing farmers to treat gains 
on sales of qualifying livestock held for draft, breeding, dairy, 
or sporting purposes as capital gains. Many economists and 
tax experts believe that farmers no longer need the prefer- 
ential treatment contained in the Internal Revenue Code. 
Others claim that farming has not changed substantially and 
that farmers still need these provisions. Still others suggest 
that changes in the agricultural sector and higher land values 
have created new problems for farmers that warrant additional 
tax relief. 


Charles Sisson’s book investigates comparative farm and non- 
farm burdens. It outlines for tax researchers and policy- 
makers the fundamental issues of farm taxation. Sisson 
discusses some important issues of farm taxation and states 
several hypotheses. First, he suggests that the tax burden for 
the farm population is substantially lower than it is for the 
U.S. population as a whole. Second, he hypothesizes that 
this viewpoint is based on the fact that farmers receive prefer- 
ential treatment under Federal income tax laws. Third, he 
suggests that the difference between farm and nonfarm tax 
burdens is attributable to the extremely low tax burdens of 
high-income farmers. Finally, he questions whether Federal 
estate taxes impose a greater tax burden on farmers than on 
the total population so that these taxes should be considered 
in the overall analysis of comparative farm and nonfarm tax 
burdens. 


Sisson resolves definitional, methodological, and data prob- 
lems inherent in farm tax research. He presents his results 
and evaluates whether his analysis is valid without recognition 
of the effects of the estate tax. Finally, he recommends areas 
for tax reform. 


Potential tax liabilities can significantly affect investment 
decisions. Sisson demonstrates that preferentially taxed farm 
investments have higher after-tax present values than other 
investments. Thus, tax preferences may significantly affect 
investment decisions and possibly lead to distortions in the 
allocation of farm resources. Have investors shifted invest- 
ment in an effort to maximize their after-tax incomes? To 
what extent are tax preferences in the farm sector encour- 
aging nonfarm investors to enter agriculture? Sisson examines 
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the issue of tax-induced capital in agriculture and attempts to 
determine the significance of tax-loss farming. 


Two fundamental questions must be resolved before we ex- 
amine the farm-nonfarm tax burden issue. First, what is a 
farmer? Second, how do we measure tax burdens? ‘‘Farmer”’ 
is an inherently ambiguous concept. Citing the changing 
nature of the farm sector and the need for a set of farm defi- 
nitions that policymakers can use for multiple purposes, 
Sisson concludes that a single definition is no longer suf- 
ficient to identify the farm population. Sisson uses five defi- 
nitions of a farmer that are based on economic and socio- 
economic criteria. These include the Bureau of the Census 
and the Internal Revenue Service (IRS) definitions as well as 
those based on income and place of residence. Examining the 
farm-nonfarm tax burden issue for various farm definitions 
provides valuable information about the farm population and 
insight into the probable effects of tax reform. 


The method by which tax burdens should be measured is a 
somewhat more difficult issue. Sisson concludes that the 
only feasible alternative is the ability-to-pay approach mea- 
sured by the Haig-Simons concept of income, a concept 
which he fully develops in the book. Sisson uses a simple 

tax burden index consisting of tax liability divided by Haig- 
Simons income. The tax liability portion of that index was 
determined on the basis of apparent tax liabilities and of alter- 
native progressive and regressive tax incidence assumptions. 
Sisson’s progressive scenario generally assumes a tax is borne 
by property income whereas his regressive alternative assumes 
that consumers bear the tax. 


Sisson’s results depend on the data set and his assumptions. 
We have already mentioned many of his assumptions, but 
have not discussed his use of the Brookings MERGE synthetic 
data set. The MERGE file is a synthetic data set compiled 
from cross-matching 87,000 records from the IRS tax file 

for 1966 with 30,000 household records from the 1967 
Survey of Economic Opportunity (SEO). Ideally, combining 
these two data sets would provide a close approximation of 
the tax returns filed in calendar year 1966 by each of the 
30,000 households in the SEO data file. If matched properly, 
the more complete tax information provided by the IRS data 
combined with the demographic information included in the 
SEO data file would be a more powerful data base than either 
data file used alone. This file was described in detail in an 
earlier issue of this journal (1).’ In the MERGE file, less than 
2 percent of the SEO data file was not matched with IRS 
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data, and these records were generally concentrated in the 
highest income classes. Sisson evaluated his results, excluding 
the unmatched data in the demographic data set (limited to 
families with less than $80,000 in adjusted gross income) and 
including the unmatched data in the income data set. Sisson 
believed that the comparison of the results from the demo- 
graphic and income data sets could provide information on 
the tax treatment of various components of the farm popula- 
tion and on their distribution. 


Sisson compares the tax burden by adjusted family income 
of the total population to the tax burdens under the five 
alternative definitions of a farmer and under varying progres- 
sive and regressive tax-incidence assumptions. Except for 

the lowest income classes, farm families have a significantly 
lower tax burden than the total population irrespective of the 
definition of farm or the incidence assumption. Sisson con- 
cludes that the benefits of preferential tax treatment for 
farmers are fairly evenly distributed across the rural sector. 


Sisson compares GINI ratios based on adjusted family income 
before taxes to measure the inequality of the income distri- 
bution within the farm sector in comparison with the total 
population.* He concludes that the income distribution is 


much more evenly distributed in the total population. He 
next calculates GINI ratios based on after-tax adjusted family 
income and concludes that, because the GINI ratios fall for 
both the total population and the alternative definitions of a 
farmer, the tax system does improve distributional equality. 
Sisson also calculates GINI ratios based on after-tax income 
with the capital gains exclusion eliminated. He concludes 
that, as the GINI ratios are not significantly affected, the 
repeal of the capital gains provisions would do little to im- 
prove distributional equality. The benefits of the capital gains 
provisions are broadly used across all income classes within 
the farm sector. 


Sisson addresses the question of tax-loss farming by analyz- 
ing the distribution of farm losses of those individuals whom 
he considers more dependent on farming as a primary occu- 
pation than others who are only marginally involved farm- 
ing. Although he acknowledges the definitional problems en- 
countered in dividing these subgroups, he nonetheless makes 
two important conclusions. First, the great bulk of farm 
losses are incurred by individuals who depend heavily on 
farming for income. Second, “while tax-loss farming is not 
pervasive in agriculture, neither is it rare.” Therefore, he con- 
cludes that tax preferences for the agricultural sector provide 
a strong incentive for high-income taxpayers to invest in 
agriculture. 


Sisson attempts to determine whether the effects of the estate 
tax on farmers and nonfarmers warrant any changes. If estate 





? The GINI coefficient is an index used to measure inequal- 
ity in the distribution of income. The GINI coefficient ranges 
from zero (perfect equality) to 1 (absolute inequality). 


tax burdens in 1966 varied significantly between farm estates 
and nonfarm estates, then the comparative tax burden 
analysis would have to be modified. Until 1977, all estates 
faced the same estate tax laws. In 1976, however, the Con- 
gress enacted two special provisions for estate taxes that 
benefit farmers. Farmers used two basic arguments in 1976 
to convince the Congress that they needed preferential estate 
tax treatment. First, they argued that the estate tax was based 
largely on land values which were unrealistically high relative 
to the land’s income-generating ability. Second, they argued 
that farm liquidity was lower than that in other types of 
estates and that the estate tax was, therefore, hard to pay. 


Sisson suggests that if farmers needed this special assistance 
in 1976, then perhaps they needed it in 1966, the year on 
which he based his analysis. He attempts to determine 
whether or not the estate tax burden for farmers was higher 
than it was for nonfarmers in 1966. 


Using the MERGE file for 1966, Sisson compares liquidity- 
to-estate ratios to determine relative farm-nonfarm estate tax 
burdens. Whereas low liquidity was certainly a fundamental 
argument supporting the enactment of preferential estate tax 
provisions for the farm sector, it is not the appropriate mea- 
sure for determining relative estate tax burdens between farm 
and nonfarm sectors. Rather, Sisson seems to be measuring a 
ratio of ‘‘undue hardship”’ as manifested in insufficient or 
barely sufficient liquid assets available to pay the estate tax 
at the time the tax is imposed. 


Sisson concludes that the relative estate tax burden in 1966 
in the farm sector vis-a-vis the total population was not suf- 
ficiently different to warrant adjusting his original analysis to 
include the estate tax. Although his findings do not actually 
support this assertion, because of his failure to measure actual 
estate tax burdens, this conclusion may, nevertheless, have 
been true in 1966. 


In his final chapter, Sisson discusses the limitations of his 
analysis. At least two of these are significant enough to cast 
doubt on his conclusions. Our most serious concern is his 
failure to differentiate between personal income earned as 


‘wages or professional fees and personal income earned from 


the ownership of a business. The Internal Revenue Code is 
saturated with tax provisions designed to stimulate business 
investment and activities. Such provisions are generally not 
available to wage earners. Thus, farmers and other business 
owners would be expected to incur lower tax burdens than 
the total population, especially in the Federal tax area to 
which Sisson attributes a significant portion of the tax bur- 
den differential. Sisson does not compare the tax burden of 
farmers with the burden of other business owners. 


Another limitation is the relative inability of the MERGE 
data file to depict current economic conditions. Sisson con- 
cludes that ‘“‘the observations regarding farm-nonfarm tax 





burdens are nearly as valid today as they were for 1966.” 

He cites a statement by Joseph Pechman in 1975 regarding 
the continued validity of the 1966 MERGE file as support. 
If Sisson means the year 1975 when he uses the word “today,’ 
then his conclusion is less subject to question. However, if he 
is speaking of the early eighties, serious doubts arise both be- 
cause of extensive Federal, State, and local tax reform and 
because of changing economic conditions in the farm sector. 
We, therefore, must agree with the author’s statement that 
his tax differential estimates “must be regarded as upper 
bounds for the effects of these special tax rules rather than 
unbiased estimates of their amounts.” 


> 


Despite the limitations of Sisson’s study, it is useful to tax 
researchers primarily because of its contributions to the use 


of synthetic microdata files in tax research and its discussions 
on fundamental farm tax research issues. For researchers less 
familiar with farm tax issues but who have some interest in 
these issues, the book provides substantial insight into those 
areas which, in the past decade, have dominated research on 
farm taxes. 
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The introduction of modern cereal varieties (MVs) into rural 
Asian communities with rapid population growth produces 
economic and social change. The authors examine the thesis 
that MVs cause economic polarization; they also examine 
Ruttan’s thesis that altered resource availabilities induce 
institutional change (/).' The authors ultimately absolve 
MVs of blame for polarization. They find that changes in 
returns to resources are brought about by institutional 
change induced by failures in factor markets. Although the 
evidence for this conclusion is drawn principally from four 
village case studies, they argue that these findings hold 
widely in Asia (as the title suggests). 


To draw an Asian conclusion from four village case studies, 
Hayami and Kikuchi take an advocate’s approach. They 
thoroughly acquaint themselves not only with these villages 
but also with the body of evidence on rural institutional 
change in Asia. They informally, but rigorously, build their 
case. They do this in a thoroughly professional manner, 
treating the reader to a highly rewarding exploration of 
critical agricultural development issues and the down-to- 
the-village facts. 


In chapter 2, Hayami and Kikuchi describe their economic 
approach to examining institutional change, defined as 
“an attempt to analyze, by the logic of economics, the 
process by which institutional change was dictated by 
noneconomic as well as economic forces.”” Economic 
determinism is specifically disavowed. They argue that 
the distribution of income among various resource con- 
tributers (landlords, tenants, and laborers) responds to 
changes in resource endowments and technology through 
adjustments in institutions governing the use of land and 
labor. The following are some examples of the application 
of economic logic to these problems: 


A person is altruistic to the extent that the 
return to altruism exceeds the cost of behaving 
as an altruist. . . .villagers will violate the village 
institutions if gains exceed costs. 


If the [enterprising village leader] senses that 
the outcome will be more efficient if each 
member of the group pays a share of the mar- 
ginal cost of additional units of the collective 
good equal to his share of the benefits. . .and 
others do not sense this, the leader will be able 
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to suggest arrangements which can leave every- 
one in the group better off. 


In chapter 3, Hayami and Kikuchi give a macro and micro 
overview of agrarian change in Asia. They present the eco- 
nomics of income distribution among laborers and asset 
holders under increasing population and changing tech- 
nology. They map two Asian roads of change: (1) polariza- 
tion of peasant communities into commercial farmers and 
landless wage-workers, and (2) stratification of communities 
into a continuous spectrum of social arrangements for 
organizing labor and assets—from landless laborers and 
noncultivating landlords, to sharecropper, to peasant 
proprietors. With polarization, traditional patron-client 
mechanisms are replaced by impersonal market relations. 
With stratification, the personal mechanisms are sustained. 
The basic principle supporting the author’s economic 
approach is that scarce resources result in a tight community 
structure which fixes economic roles, whereas informal 
markets require informal social structures. 


The remainder of the text (200 pages) is devoted to examin- 
ing whether polarization or stratification predominates 

and, on finding that stratification does, to exploring whether 
peasant polarization can be forestalled under increasing 
population pressure. The conclusion is that polarization can 
be avoided if efforts to generate scale-neutral technological 
progress (such as MVs) continue. 


The polarization/stratification conclusions are based on a 
review of evolving agrarian institutions in Luzon (the Phil- 
ippines), a survey of rice harvesting methods in 100 villages, 
and two village case studies. The authors base their con- 
clusion on prospects for continued adjustment to popula- 
tion pressure through peasant stratification on a review of 
agrarian change in Java, a survey of rice harvesting methods 
in 48 villages, and two village case studies (one technolog- 
ically stagnant and the other technologically progressive). 


This book is a major step forward. It brings the insights 

of various disciplines to bear on the forces governing village 
change. The authors, both agricultural economists, enter 
the terrain of other disciplines and extend their own pre- 
cepts into new realms. There is always a risk that in doing 
so one will lose or misrepresent the contribution of others, 
building up rather than tearing down barriers to understand- 
ing. It is clear that they took great care to avoid this, with 
solid success. They demonstrate a wealth of understanding 
of village society and institutions, and they clearly demon- 
strate how provision is made for resource organization and 
management. 
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Through their research approach and their demonstrated 
depth of understanding, the authors have broken new ground 
and have produced a thoroughly readable book. It is en- 
livened by a common theme that runs throughout: Marxian 
prophecy of the violent course of human events as popula- 
tion presses on land and its nonfulfillment in this case. 
According to Marx, “At a certain stage of development, 
the material forces of production in society come into 
conflict with the property relations within which they 

had been at work before.” The authors share with Marx an 
awareness of this conflict, but not his notion of an abrupt 
and complete transformation of the entire institutional 
framework. Village stratification is the revealed mechanism 
for resolving that conflict. Radical political economists 

are identified as perpetrators of the notion that new tech- 
nologies (such as MVs) are major factors in promoting 
inequality and class differentiation. 


I find the authors’ verdict of the MV’s absolute innocence 
of this charge welcome news indeed; however, they might 
have avoided highlighting the implications for radical 
economic doctrine and prophecy. Doctrines long held 
succumb only to overwhelming evidence to the contrary. 
The authors’ building-block approach is the right one at 


this stage for advancing understanding of a complex and 
challenging problem. But their approach is inadequate 

to the task of challenging doctrine, because it also employs 
the method of advocacy. The authors conclude that villages 
“do change in response to changes in the relative scarcity 
of resources, but the change takes a very long time, often 
generations.” Rival purveyors of doctrine have little appre- 
ciation for that fact, as their interventions sometimes 
grossly demonstrate. I think the fundamental development 
assistance policy prescription which flows from this book 
is to proceed with caution and to check your assumptions. 
Village polarization increases social instability. Rapid 
development of markets may enhance this instability. But, 
greater mobility of resources and the consequent special- 
ization of production and exchange of goods which accom- 
panies market development is everywhere a fundamental 
characteristic of economic growth. 
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Economic development over the last three decades has been 
undertaken largely as a collection of disconnected projects. 
Therefore, the need for an integrative theory that deals 

with the fundamental causes of the distribution of income, 
wealth, and rights is one of the two major premises on which 
this book is based. The other is that the integrative con- 
cepts were extremely general and abstract. This situation 

left the practitioners of economic development in a vacuum, 
and they could only approach a global framework in a rather 
remote fashion. 


The author is a professor of agriculture and'resource eco- 
nomics at the University of California. His book draws on 
an extensive bibliography and is supported by his own 
experience and observation from extended stays in Argen- 
tina, Chile, Colombia, and Peru, as well as frequent visits 

to Ecuador, Mexico, Brazil, and the Dominican Republic. 
He examines and builds numerous theories and postulates 
of Marx, Ricardo, Keynes, Lenin, Kautzky, and others, and 
he draws on the thoughts of contemporary analysts includ- 
ing Prebish, Schultz, Ruttan, and Schuh. His analysis is 
developed from a broad base of observation and theory, 
which leads him to conclude that any proposal to eliminate 
poverty and improve the distribution of income within 

and among nations needs to originate in a positive analysis 
of the multiple sources of unequal development. He fails 
to find satisfaction in neoclassical economics and chooses 
the perspective cf political economy to integrate social, 
political, and economic thought—an essential element in 
understanding the multiple forces that underlie the demand 
for agragrian reform. De Janvry uses this comprehensive 
framework to analyze the underlying cause of change in 
the interdependence between large landowners and pea- 
sants, mechanization and the shift to capitalism, land 
reform in its various aspects, urbanization and labor 
availability, economic growth and the unequal distribu- 
tion of income and wealth, the persistence of poverty, 

and other changes in the structure and performance of 
agriculture in Latin America. 


Based on this integrated theory, de Janvry distinguishes 
and examines in detail the changes in agriculture during 
two recent periods of transformation in Latin America— 
the fifties, which were marked by import-substitution 

in industry at a time when strong vestiges of noncapital- 
istic or feudalistic social relations in agriculture resisted 
agrarian reform, and the sixties and seventies, when 
capitalistic modes of production were introduced to 
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agriculture, thereby redefining the role of the peasantry 
in bringing about shifts in political power and changes 
in social interdependence. 


Within this integrative framework, de Janvry describes 
and analyzes these areas: 


@ The forces underlying the agrarian crises in several 
Latin American countries; 

@ The stagnation of food production prior to the 
sixties and the uneven development of capitalist 
and peasant agriculture in the ensuing period; 

The instability of the position of the peasant class 
and the dynamics of its transformation; 

The conflicts during the transition to capitalism 
between rents and profits, between cheap food 
and foreign exchange, and between peasant labor 
and mechanization; 

The varied nature of land reform and the transition 
from precapitalist to capitalist agriculture; and 
The political and economic purposes underlying 
some of the rural development programs and their 
often contradictory nature. 


De Janvry concludes with a critical examination of several 
current policies of world agencies which purport to stimulate 
equitable agrarian growth. These policies include: 


@ The International Labor Organization proposal 
for labor-intensive development. 

@ The World Bank and the U.S. Agency for Inter- 
national Development’s promotion of integrated 
rural development with emphasis on small farmers. 
The Overseas Development Council’s suggestion 
that calls for national governments to provide an 
improved standard of welfare for the poor. 


The author finds that each of these proposals has short- 
comings as an agrarian reform strategy because it fails to 
deal with the fundamental causes of poverty and maldis- 
tribution of income. He subscribes to a strategy of “‘basic 
needs,” but not one that would act as an income constraint 
on growth. Instead, he emphasizes one that would achieve 
those political, social, and economic structural changes 
needed not only to reconcile growth and distribution but 
also make the satisfaction of basic needs the essential pur- 
pose of economic growth. As economic growth occurs and 
personal income increases, the basic needs would be re- 
defined beyond mere subsistence and would be met with 
rising productivity of labor. 
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Although one may agree or disagree with de Janvry’s final analysis of agrarian issues. Whether or not it persuades 
prescription for economic development, the book is well the practitioners will depend on their agreement with the 
written and succeeds in developing a framework for an author’s fundamental values and views. 








On the Misuse of Theil’s Inequality Coefficient 


By Raymond M. Leuthold* 








In a recent article, “The Food and Agricultural Policy 
Simulator,” by Salathe, Price, and Gadson in this journal 
(4),' the authors use a Theil inequality coefficient (p. 7, 
equation (12)) to validate the performance of their model. 
They used actual values of Y; and Y;- 1 when computing 
this statistic. The statistic was never intended to be used 
in this way because it is sensitive to additive transforma- 
tions. That is, if one moves the decimal point to the right 
for each variable, one can generate lower coefficients. Theil 
(5) pointed out that instead of actual values, only changes 
in the variables should be used. A thorough discussion of 
the appropriateness of Theil coefficients is contained in 
Leuthold (2). 


Salathe, Price, and Gadson cite two references: Pindyck and 
Rubinfeld (3) and Kost (1). The former never mentions 

this problem, but the latter goes so far as to prove the 
additive feature. A U in actual values will always be less 
than a U when data are measured in changes. Kost went 

on to compute U, Uj, and Us, and as expected, 

U < Uj < Ug in all but three cases. No explanation is 
offered for these exceptions. 


Salathe, Price, and Gadson do correctly point out their 
statistic MARE is independent of the units of measure of 
Y. They go on to imply they want another statistic 
bounded from above by 1, but the question is: Why 
impose this restriction? One suspects the prime reason 

is to generate low U coefficients. (Again, see (3) on this 
point.) Their results seem to reflect the unit problem. With 
the exception of oats, note that the U cofficients for prices 
of grains (wheat, corn, barley, sorghum, soybeans, and 
cotton) are all higher than those for prices of meat products 
(barrows and gilts, steers, cows, broilers, turkeys, eggs, 

and milk). I suspect this occurs because all the products 

in the latter group are priced in units that are roughly 10 
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times the units of price for the former group. The U statistic 
would generally predict those relationships. (Milk seems to 
be another odd case here.) Also, the authors point out that 
U statistics are higher for prices than for corresponding 
production variables. Could that again be simply a reflection 
of units and not relative performance? (Perplexing here is 
that MARE had a similar tendency, and it is not unit- 
sensitive). 


Salathe, Price, and Gadson have defined and used a U 
coefficient in a manner that Theil never intended. Theil 
defined it differently from the very beginning in terms of 
the type of data inputs. Kost used this coefficent in 1980, 
but he covered himself by also computing U and Uo. The 
U’s reported by Salathe, Price, and Gadson are probably 
much too low, reflecting greater accuracy than is really 
the case. Were they to publish an addendum to their paper 
where they also show Uj and Us, I would expect some 
changes in relative results among commodities, and I would 
expect higher coefficients reflecting the true performance 
errors. 
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Comment on the Misuse of Theil’s Inequality Coefficient 


By William E. Kost* 








The unexplained inconsistency in U, Uj, and Ug mentioned 
by Leuthold in the validation results for my net trade model 
was due to arithmetic errors. The original calculations were 
done by hand because no computer programs were then 
available in ERS for making this calculation. When the 
errors were corrected, U<U;<U2 —as Leuthold predicted. 


However, there is another way an inconsistency may show 
up. It involves the definitions of the inequality coeffi- 
cients.’ U, and Ud use lagged values of the variable. U 
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where Y is the actual value, Y is the estimated value, and 
T is the number of periods. 


does not. When using all possible data in calculating Theil 
inequality coefficients, the U calculation, therefore, uses 
one more observation than U; and Ug. Depending on the 
magnitude of the error in this first observation, the 
U<Uj<Ug relationship may not hold. 


The emphasis Leuthold places on the cross comparison of 
the three coefficients appears too strong. Although similar, 
these three coefficients measure different things. Their 
value is in pointing to ways to improve a model. Does the 
revised model generate a better U (or U;, or U2)? The 
coefficients give users some idea of how good a model is. No 
one measure of goodness of fit is satisfactory. All measures 
normally used provide some useful information. The con- 
clusion a user (or model builder) draws from the informa- 
tion is subjective. Though it has an additive bias, the U 
statistic is still a useful measure, particularly when one is 
comparing two versions of the same model. In that case, 
no scale problem exists. To use it, of course, one must 
know how to interpret it. But, the same is true for all other 
measures of goodness of fit. 
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